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AB The present invention provides a collection or library of 8051 

nucleic acid contig sequences assembled from expressed sequence tag 
or cDNA libraries isolated mainly by sequencing by hybridization 
(SBH) , std. PCR, Sanger sequencing techniques, and in some cases, 
sequences obtained form one or more public databases. The cDNA 
libraries are from human tissue sources and nearest neighbor 
sequence homologies are provided. The invention also relates to the 
proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and 
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AB Cbmpns. and methods for the therapy and diagnosis of acne vulgaris 
and. other related conditions are disclosed. Compns . may comprise 
one or more Propionibacterium acnes proteins, immunogenic portions 
thereof, or polynucleotides that encode such portions. Thus, 
overlapping clones representing .apprx.8.6 full-length genome equiv. 
from a P. acnes genomic library were aligned to form 299 linear 
contigs. These 299 contigs represent a total assembled length of 
about 2,656,860 nucleotides covering >90% of the P. acnes genome. 
Six-frame translation is performed in order to predict 28,913 open 
reading frames encoding P. acnes polypeptide sequences .gtoreq.50 
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amino acids in length. A therapeutic compn. may also comprise an 
antibody that binds a P. acnes protein, antigen-presenting cells 
that express a P. acnes protein, or a T cell that is specific for 
cells expressing such a protein. Such compn s . may be used, for 
example, for the prevention and/or treatment of acne. [This abstr. 
record is the first of six records for this document necessitated by 
the large no. of index entries required to fully index the document 
and publication system constraints.]. 
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AB The present invention provides 54 97 novel nucleic acids, 54 97 novel 
polypeptide sequences encoded by these nucleic acids, and their uses 
for diagnostic, therapeutic, and research purposes. A collection or 
library of the novel nucleic acid sequences were assembled from 
expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization, and in some cases, sequences obtained from one or 
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WO 2000- 
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AB The present invention 
polypeptide sequences 
thereof. A plurality 
libraries prepd. from 



provides 17 68 novel nucleic acids, novel 
encoded by these nucleic acids, and uses 
of novel nucleic acids were obtained from cDNA 
various human tissues and in some cases from a 
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genomic library derived from human chromosomes using std. PCR, 
sequencing by hybridization (SBH) sequence signature anal., and 
Sanger sequencing techniques . The contigs or nucleic acids of the 
present invention were assembled using an EST sequence as a seed, 
with a recursive algorithm used to extend the seed EST into an 
extended assemblage by pulling addnl. sequences from different 
databases that belong to this assemblage. Full-length gene cDNA 
sequences and their corresponding protein sequences were generated 
from the assemblage. 

IT 352374-90-4 352374-91-5 352374-92-6 

RL: ANT (Analyte) ; BOC (Biological occurrence) ; BSU (Biological 
study, unclassified); BUU (Biological use, unclassified); PRP 
(Properties); ANST (Analytical study); BIOL (Biological study); OCCU 
(Occurrence); USES (Uses) 

(amino acid sequence; human nucleic acids and polypeptides) 
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INVENTOR (S) : 
PATENT ASSIGNEE (S) 
SOURCE: 



DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2001:241691 HCAPLUS 
134:261275 

Use of a BMP protein receptor complex for 

screening bone metabolism actives and cells 

co-transfected with a type II BMP receptor and 

type I BMP receptor 

Rosenbaum, Jan Susan 

The Procter & Gamble Company, USA 

U.S., 85 pp., Cont .-in-part of U.S. Ser. No. 

334,178, abandoned. 

CODEN: USXXAM 

Patent 

English 

2 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



US 6210899 Bl 

WO 9614579 Al 

W: AM, AU, BB, BG, 

KG, KP, KR, KZ, 

PL, RO, RU, SG, 

RW: KE, LS, MW, SD, 

IE, IT, LU, MC, 

ML, MR, NE, SN, 

AU 9539713 Al 

AU 710559 B2 

EP 789844 Al 

EP 789844 Bl 

R: AT, BE, CH, DE, 
SE 

JP 10510047 T2 

AT 213064 E 

PRIORITY APPLN. INFO.: 



20010403 

19960517 
BR, BY, CA, 
LK, LR, LT, 
SI, SK, TJ, 
SZ, UG, AT, 
NL, PT, SE, 
TD, TG 

19960531 

19990923 

19970820 

20020206 
DK, ES, FR, 

19980929 
20020215 



US 1995-462467 19950605 
WO 1995-US14027 19951030 
CN, CZ, EE, FI, GE, HU, IS, JP, 
LV, MD, MG, MK, MN, MX, NO, NZ, 
TM, TT, UA, UZ, VN 

BE, CH, DE, DK, ES, FR, GB, GR, 

BF, BJ, CF, CG, CI, CM, GA, GN, 



AU 1995-39713 



19951030 



AB 



EP 1995-937676 19951030 

GB, GR, IE, IT, LI, LU, NL, PT, 

JP 1995-515373 19951030 
AT 1995-937676 19951030 
US 1994-334178 B2 19941104 
US 1995-462467 A 19950605 
WO 1995-US14027 W 19951030 
The present invention relates to a method for detg. whether a compd. 
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IT 



is capable of binding to a BMP receptor kinase protein complex. The 
invention further relates to a method for detg. the concn. of a BMP 
receptor ligand in a clin. sample. The invention also relates to a 
host cell co-transfected with an expression vector comprising a DNA 
sequence that codes for the BMP receptor kinase protein BRK-3 and an 
expression vector comprising a DNA sequence that codes for a BMP 
type I receptor kinase protein. The invention further relates to a 
host cell co-transfected with an expression vector comprising a DNA 
sequence that codes for a sol. or incomplete BMP type I receptor 
kinase protein and a sol. or incomplete BMP receptor kinase protein 
BRK-3. The invention further relates to a method for detg. whether 
a test compd. produces a signal upon binding to a BMP receptor 
protein complex. 
332001-59-9P 332001-60-2P 

RL: BPN (Biosynthetic preparation) ; PRP (Properties); THU 
(Therapeutic use); BIOL (Biological study); PREP (Preparation); USES 
(Uses) 

(amino acid sequence; use of BMP protein receptor complex for 
screening bone metab. actives and cells co-transfected with type 
II BMP receptor and type I BMP receptor) 
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2001:208400 HCAPLUS 
134:248841 

Crystalline three-dimensional structure of a 

metallo . beta . -lactamase IMP-1 from Pseudomonas 

aeruginosa' and its complex with the inhibitor 

SB-252619, and applications to drug design 

Abdel-Meguid, Sherin S.; Concha, Nestor 0. 

SmithKline Beecham Corporation, USA 

PCT Int. Appl. , 243 pp. 

CODEN: PIXXD2 " 

Patent 

English 

1 



PATENT NO. 



KIND DATE 



WO 2001019971 
W: CA, JP, 
RW: AT, BE, 
NL, PT, 

PRIORITY APPLN. INFO 
AB 



20010322 



APPLICATION NO. DATE 



WO 2000-US25340 20000915 



CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 



IT 



Al 

US 
CH, 

SE 

: US 1999-154749P P 19990917 

Native crystal structure of a novel Pseudomonas aeruginosa metallo 
.beta. -lactamase IMP-1 and crystal structure of the P. aeruginosa 
metallo . beta . -lactamase IMP-1 complexed with the inhibitor 
SB-252619 are disclosed. The invention provides direct information 
on the specific role of the residues in the active site responsible 
for the binding of inhibitors, substrates and substrate analogs. 
This information could be used in search for new antibacterial drugs 
and in designing drugs useful for inhibiting the P. aeruginosa 
.beta . -lactamase IMP-1. 
331287-05-9 
RL: PRP (Properties) 
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(unclaimed protein sequence; cryst . three-dimensional structure 
of a metallo . beta . -lactamase IMP-1 from Pseudomonas aeruginosa 
and its complex with the inhibitor SB-252619, and applications to 
drug design) 
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DOCUMENT NUMBER: 135:15597 

TITLE: Molecular and functional characterization of a 

family of rat brain T-type calcium channels 
AUTHOR (S) : McRory, John E.; Santi, Celia M. ; Hamming, Kevin 

S. C; Mezeyova, Janette; Sutton, Kathy G. ; 

Baillie, David L.; Stea, Anthony; Snutch, 

Terrance P. 

CORPORATE SOURCE: Biotechnology Laboratory, University of British 

Columbia, Vancouver, BC, V6T 1Z3, Can. 

SOURCE: Journal of Biological Chemistry (2001), 276(6), 

3999-4011 

CODEN: JBCHA3; ISSN: 0021-9258 
PUBLISHER: American Society for Biochemistry and Molecular 

Biology 

DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Voltage-gated calcium channels represent a heterogeneous family of 
calcium-selective channels that can be distinguished by their mol., 
electrophysiol . , and Pharmacol, characteristics. We report here the 
mol. cloning and functional expression of three members of the low 
voltage-activated calcium channel family from rat brain (. alpha. 1G, 
.alpha. 1H, and .alpha. II). Northern blot and reverse 
transcriptase-polymerase chain reaction analyses show .alpha. 1G, 
.alpha. 1H, and .alpha. II to be expressed throughout the newborn and 
juvenile rat brain. In contrast, while .alpha. 1G and .alpha. 1H mRNA 
are expressed in all regions in adult rat brain, .alpha. II mRNA 
expression is restricted to the striatum. Expression of .alpha. 1G, 
.alpha. 1H, and .alpha. II subunits in HEK293 cells resulted in 
calcium currents with typical T-type channel characteristics: low 
voltage activation, neg. steady-state inactivation, strongly 
voltage-dependent activation and inactivation, and slow 
deactivation. In addn., the direct electrophysiol. comparison of 
.alpha. 1G, .alpha. 1H, and .alpha. II under identical recording 
conditions also identified unique characteristics including 
activation and inactivation kinetics and permeability to divalent 
cations. Simulation of .alpha. 1G, . alpha. 1H, and .alpha. II T-type 
channels in a thalamic neuron model cell produced unique firing 
patterns (burst vs. tonic) typical of different brain nuclei and 
suggests that the three channel types make distinct contributions to 
neuronal physiol. 

IT 342874-40-2 

RL: BAC (Biological activity or effector, except adverse); BOC 
(Biological occurrence); BPR (Biological process); BSU (Biological 
study, unclassified); PRP (Properties); BIOL (Biological study); 
OCCU (Occurrence); PROC (Process) 

(amino acid sequence; cloning, sequence and characterization of a 
family of rat brain T-type calcium channels) 
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2001:106055 HCAPLUS 
134:188985 

Human expressed sequence tags and primers for 

synthesizing full-length cDNAs 

Ota, Toshio; Isogai, Takao; Nishikawa, Tetsuo; 

Hayashi, Kohji; Saito, Kaoru; Yamamoto, Junichi; 

Ishii, Shizuko; Sugiyama, Tomoyasu; Wakamatsu, 

Ai; Nagai, Keiichi; Otsuki, Tetsuji 

Helix Research Institute, Japan 

Eur. Pat. Appl., 2527 pp. 

CODEN: EPXXDW 

Patent 

English 

9 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



EP 



1074617 
R: AT, 
PT, 
1205549 
R: AT, 
PT, 

PRIORITY APPLN. 



EP 



BE, 
IE, 

BE, 
IE, 
INFO 



A2 20010207 
CH, DE, DK, ES, 
SI, LT, LV, FI, 

Al 20020515 
CH, DE, DK, ES, 
SI, LT, LV, FI, 



FR, 

RO 

FR, 
RO, 



MC, 



MC, 



EP 2000-116126 20000728 
GB, GR, IT, LI, LU, NL, SE, 

EP 2000-948282 20000728 
GB, GR, IT, LI, LU, NL, SE, 
MK, CY, AL 
JP 1999-248036 A 19990729 
JP 1999-300253 A 19990827 
JP 2000-118776 A 20000111 
JP 2000-183767 A 20000502 
JP 2000-241899 A 20000609 
US 1999-159590P P 19991018 
US 2000-183322P P 20000217 
WO 2000-JP5065 W 20000728 
AB Primers for synthesizing full-length cDNAs and their use are 

provided. The invention provides 5'-end sequences for 5602 partial 
cDNA sequences (expressed sequence tags, ESTs) and 3 1 -end sequences 
for 4970 of these clones. Furthermore, primers for synthesizing the 
full-length cDNA have been provided to clarify the function of the 
protein encoded by the cDNA. The full-length cDNA sequences s of 
the present invention contg. the translation start site provides 
information useful for analyzing the functions of the proteins. 
Tissue- and cell-specific expression patterns are also provided. 
[This abstr. record is one of 6 records for this patent necessitated 
by the large no. of index entries required to fully index the 
document and publication system constraints.]. 
IT 326928-35-2, Protein (human clone PLACE1004777 ) 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; human expressed sequence tags and primers 
for synthesizing full-length cDNAs) 
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INVENTOR (S) : 

PATENT ASSIGNEE (S ) : 
SOURCE: 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY, ACC. NUM. COUNT: 
PATENT INFORMATION: 



ligand detection 

Matsudaira, Paul T.; Eh 

Qiuhui; Freyson, Yelena 

Whitehead Institute for 

PCT Int. Appl., 4 4 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



lien, Daniel J.; Zhong, 
Biomedical Research, USA 



PATENT NO. KIND DATE APPLICATION NO. DATE 



WO 2001009177 A2 20010208 WO 2000-US20619 20000728 

W: CA, JP 

RW: AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE 

PRIORITY APPLN. INFO.: US 1999-146438P P 19990729 

AB The present invention is related to an affinity fluorescent protein 
(aFP) comprising a modified fluorescent protein or mol . which 
comprises a heterologous amino acid sequence, thereby introducing a 
ligand-activated protein binding site. The modified fluorescent 
protein displays an altered spectral property when the binding site 
is engaged with ligand relative to the spectral property displayed 
when the binding site is not engaged by ligand. The hexapeptide 
Leu-Glu-Pro-Arg-Ala-Ser which contains 3 restriction enzyme sites 
(Xhol-Avrl-Nhel) is useful for identifying fluorescent insensitive 
sites in the green fluorescent protein (GFP) . An epitope from 
hemagglutinin (HA tag comprising Tyr-Pro-Tyr-Asp-Val-Pro-Asp-Tyr- 
Ala) that is recognized by the monoclonal antibody 12CA5 is inserted 
into between residues Glnl57-Lysl58 and/or Glul72-Aspl73 and/or at 
the C-terminus of GFP; a Ser-147-Pro substitution is introduced into 
GFP for improved stability. The present invention also relates to 
an aFP expression cassette comprising a modified fluorescent protein 
nucleic acid sequence operatively linked to expression control 
sequences, wherein the modified fluorescent protein sequence 
comprises a recombinant peptide which comprises restriction 
endonuclease sites. The present invention also relates to a method 
of detecting the presence of a target ligand in a mixt. of 
macromols. Also encompassed by the present invention is a method of 
detecting the occurrence of a target ligand in a cell (e.g., a 
macrophage, a yeast cell) . 
IT 324745-26-8 

RL: BUU (Biological use, unclassified); BIOL (Biological study); 
USES (Uses) 

(affinity ligand; affinity fluorescent proteins and uses for 
ligand detection) 
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Cloning and structural characterization of 
ECTACC, a new member of the transforming acidic 
coiled coil (TACC) gene family: cDNA sequence 
and expression analysis in human microvascular 
endothelial cells 

Pu, Jeffrey J.; Li, Chaoyang; Rodriguez, 
Marilis; Banerjee, Debendranath 
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CORPORATE SOURCE: Department of Membrane Biochemistry II, The 

Lindsley F. Kimball Research Institute, New York 
Blood Center, New York, NY, 10021, USA 

SOURCE: Cytokine (2001), 13(3), 129-137 

CODEN: CYTIE9; ISSN: 1043-4666 

PUBLISHER: Academic Press 

DOCUMENT TYPE: Journal 

LANGUAGE: English 

AB Erythropoietin (Epo) transduces mitogenic and chemoattractant 
signals to human endothelial cells. Identifications of 
Epo-responsive genes are important for understanding the mol. nature 
of Epo signaling in endothelial cells. The effects of Epo on 
differential expression of various genes were examd. in human 
microvascular endothelial cells (HMVEC) by differential display 
reverse transcriptase polymerase chain reaction (RT-PCR) . In the 
current study we obtained from Epo— treated HMVEC a cDNA fragment 
with characteristics of the 3' end of mRNA. Using the cDNA 
fragment, we then selectively isolated a full-length clone by 
screening an unamplified endothelial cell cDNA library followed by 
5 ! rapid amplification of cDNA ends by polymerase chain reaction 
(RACE-PCR) . The nucleotide sequence of the longest cDNA revealed an 
open reading frame of 3311 nucleotides that encodes a protein 
consisting of .apprx.906 amino acids with a predicted MW of 
.apprx.100 kDa. The nucleotide sequence of the cDNA is nearly 
identical to that of transforming acidic coiled coil-contg. (TACC2) 
and anti-zuai-1 (AZU-1) cDNA clones except at the 5'- and 3 1 -ends . 
Northern blot anal, showed an increase in endothelial-TACC-related 
mRNA levels in Epo-treated cells in comparison to that of the 
control cells. Endothelial-TACC-related mRNA was highly expressed 
in heart and skeletal muscle tissue. Placenta and brain tissue 
exhibited low levels of expression of endothelial-TACC-related gene. 
Southern blot anal, of genomic DNA from somatic cell hybrids showed 
that endothelial-TACC-related cDNA maps to chromosome 10. 
Immunofluorescence microscopy and the occurrence of several putative 
phosphorylation and SH3 binding sites on the deduced protein suggest 
that endothelial-TACC-related protein may be involved in Epo 
signaling cascades in endothelial cells. (c) 2001 Academic Press. 

IT 355157-34-5, Protein ECTACC (human) 

RL: BPR (Biological process); BSU (Biological study, unclassified); 
PRP (Properties); BIOL (Biological study); PROC (Process) 

(amino acid sequence; cloning and structural characterization of 
ECTACC of human, a new member of the transforming acidic coiled 
coil (TACC) gene family) 
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TITLE: Novel mammalian calcium channels and related 

probes, cell lines and methods 
INVENTOR (S ) : Snutch, Terrance P.; Baillie, David L. 

PATENT ASSIGNEE (S): Neuromed Technologies, Inc., Can. 

SOURCE: PCT Int. Appl . , 103 pp. 

CODEN: PIXXD2 
DOCUMENT TYPE: Patent 
LANGUAGE: English 
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FAMILY ACC. NUM. COUNT: 1 
PATENT INFORMATION: 



PATENT NO. 
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WO 2000-CA794 



20000704 



WO 2001002561 A2 20010111 

WO 2001002561 A3 20010628 

W: AU, CA, IL, JP, NO, US 

RW: AT, BE, CH, CY, DE,' DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, 
NL, PT, SE 

EP 1190053 A2 20020327 EP 2000-945479 20000704 

R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, 
PT, IE, FI 

PRIORITY APPLN. INFO.: US 1999-346794 A 19990702 

WO 2000-CA794 W 20000704 

AB Sequences and partial sequences for three types of mammalian (human 
and rat sequences identified) T-type calcium channel subunits which 
we have labeled as the .alpha. 1G, .alpha. 1H and .alpha. II subunits 
are provided. Knowledge of the sequence of these calcium channels 
permits the localization and recovery of the complete sequence from 
human cells, and the development of cell lines which express the 
novel calcium channels of the invention. These cells may be used 
for identifying compds . capable of acting as agonists or antagonists 
to the calcium channels. 

IT 319502-44-8 

RL: PRP (Properties) 

(unclaimed protein sequence; novel mammalian calcium channels and 
related probes, cell lines and methods) 
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HCAPLUS COPYRIGHT 2002 ACS 
2001:12616 HCAPLUS 
134:82484 

Fanconi anemia protein interacting proteins 
FANCIP2 and FANCIP3 and cDNAs and methods for 
diagnosis and treatment of diseases 
Gross, Hans Joachim; Reuter, Tanja; Hanenberg, 
Helmut; Herterich, Sabine; Wagner, Matthias 
Multigene Biotech Gmbh, Germany 



PCT Int. Appl. 

CODEN : . PIXXD2 

Patent 

German 

1 



60 pp. 



PATENT NO. 



KIND DATE 



APPLICATION NO. 



DATE 



WO 2001000822 A2 

WO 2001000822 A3 

W: AE, AL, AM, AU, 

DM, EE, GD, GE, 

LC, LK, LR, LT, 

RU, SG, SI, SK, 

RW: GH, GM, KE, LS, 

CY, DE, DK, ES, 

BF, BJ, CF, CG, 

DE 19929887 Al 

EP 1194547 A2 



20010104 

20010705 
AZ, BA, BB, 
HR, HU, ID, 
LV, MA, MD, 
TJ, TM, TR, 
MW, MZ, SD, 
FI, FR, GB, 
CI, CM, GA, 

20010111 

20020410 



WO 2000-EP5878 20000626 

BG, BR, BY, CA, CN, CR, CU, CZ, 

IL, IN, IS, JP, KG, KP, KR, KZ, 

MG, MK, MN, MX, NO, NZ, PL, RO, 

TT, TZ, UA, US, UZ, VN, YU, ZA 

SL, SZ, TZ, UG, ZW, AT, BE, CH, 

GR, IE, IT, LU, MC, NL, PT, SE, 

GN, GW, ML, MR, NE, SN, TD, TG 
DE 1999-19929887 19990629 
EP 2000-943885 20000626 
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R: AT, BE, CH, DE, DK, ES, FR, GB, GR, IT, LI, LU, NL, SE, MC, 
PT, IE, SI, LT, LV, FI, RO 
PRIORITY APPLN. INFO.: DE 1999-19929887 A 19990629 

WO 2000-EP5878 W 20000626 
AB The invention relates to cDNA sequences encoding proteins FANCIP2 

and FANCIP3, which interact with the Fanconi anemia complementation 
group A { FANCA) protein, and the corresponding encoded proteins. 
The invention also relates to antibodies directed to said proteins, 
to FANCIP2- or FANCIP3-transgenic organisms and cells, and to the 
use of FANCIP2 and FANCIP3 for effector screening, and to the 
pharmaceutical application of the inventive nucleic acids, proteins 
and antibodies. 
IT 316927-56-7 

RL: PRP (Properties) 

(unclaimed protein sequence; fanconi anemia protein interacting 
proteins FANCIP2 and FANCIP3 and cDNAs and methods for diagnosis 
and treatment of diseases) 
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DOCUMENT NUMBER: 133:130477 

TITLE: The genome sequence of the plant pathogen 

Xylella fastidiosa 

AUTHOR(S): Simpson, A. J. G.; Reinach, F. C; Arruda, P.; 

Abreu, F. A.; Acencio, M.; Alvarenga, R.; Alves, 
L. M. C; Araya, J. E.; Bala, G. S.; Baptista, 

C. S.; Barros, M. H.; Bonaccorsi, E . D.; Bordin, 
S.; Bove, J. M.; Briones, M. R. S.; Bueno, M. R. 
P.; Camargo, A. A.; Camargo, L. E. A.; Carraro, 

D. M.; Carrer, H.; Colauto, N. B.; Colombo, C; 
Costa, F. F.; Costa, M. C. R. ; Costa-Neto, C. 
M.; Coutinho, L. L. ; Cristofani, M. ; Dias-Neto, 

E. ; Docena, C; El-Dorry, H.; Facincani, A. P.; 
Ferreira, A. J. S.; Ferreira, V. C. A.; Ferro, 
J. A.; Fraga, J. S.; Franca, S. C; Franco, M. 
C; Frohme, M. ; Furtan, L. R. ; Garnier, M. ; 
Goldman, G. H.; Goldman, M. H. S.; Gomes, S. L.; 
Gruber, A.; Ho, P. L.; Hoheisel, J. D.; 
Junqueira, M. L. ; Kemper, E. L.; Kitajima, J. 
P.; Kreiger, J. E . ; Duramae, E. E.; Laigret, F. ; 
Lambals, M. R. ; Lette, L. C. C; Lemos, E. G. 
M.; Lemos, M. V. F. ; Lopes, S. A.; Lopes, C. R. ; 
Machado, J. A.; Machado, M. A.; Madeira, A. M. 
B. N . ; Madeira, H. M. F. ; Marino, C. L.; 
Marques, M. V.; Martins, E. A. L.; Martins, E. 
M. F.; Matsukuma, A. Y.; Menck, C. F. M.; 
Miracca, E. C; Miyaki, C. Y. ; 

Monteiro-Vitorello, C. B.; Moon, D. H.; Nagai, 
M. A.; Nascimento, A. L. T. 0.; Netto, L. E. S . ; 
Nhanl, A., Jr.; Nobrega, F. G. ; Nunes, L. R. ; 
Oliveira, M. A.; de Oilveria, M. C; de 
Oliveira, R. C; Palmieri, D. A.; Paris, A.; 
Peixoto, B. R.; Pereira, G. A. G.; Perelra, H. 
A.; Pesquero, J. B . ; Quaggio, R. B. ; Roberto, P. 
G.; Rodrigues, V.; Rosa, A. J. de M.; de Rosa, 
V. E . , Jr.; de Sa, R. G.; Santelli, R. V.; 
Sawasaki, H. E.; da Silva, A. C. R. ; da Silva, 
A. M.; da Silva, F. R. ; Silva, W. A., Jr.; da 
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CORPORATE SOURCE: 
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PUBLISHER: 
DOCUMENT TYPE: 
LANGUAGE : 
AB 



IT 



Silveira, J. F. ; Silvestri, M. L. Z.; Siqueira, 
W. J.; de Souza, A. A.; de Souza, A. P.; 
Terenzi, M. F. ; Truffi, D. ; Tsai, S. M.; 
Tsuhako, M. H.; Vallada, H . ; Van Sluys, M. A.; 
Ver jovski-Almeida, S.; Vettore, A. L . ; Zago, M. 
A.; Zatz, M . ; Meidanis, J.; Setubal, J. C. 
Instituo Ludwig de Pesquisa sobre o Cancer, Sao 
Paulo, 01509-010, Brazil 

Nature (London) (2000), 406(6792), 151-157 
CODEN: NATUAS; ISSN: 0028-0836 
Nature Publishing Group 
Journal 
English 

Xylella fastidiosa is a fastidious, xylem-limited bacterium that 
causes a range of economically important plant diseases. The 
complete genome sequence of X. fastidiosa clone 9a5c, which causes 
citrus variegated chlorosis-a serious disease of orange trees, is 
reported. The genome comprises a 52.7% GC-rich 2 , 67 9, 305-base-pair 
(bp) circular chromosome and two plasmids of 51,158 bp and 1,285 bp. 
Putative functions can be assigned to 47% of the 2904 predicted 
coding regions. Efficient metabolic functions are predicted, with 
sugars as the principal energy and carbon source, supporting 
existence in the nutrient-poor xylem sap. The mechanisms assocd. 
with pathogenicity and virulence involve toxins, antibiotics and ion 
sequestration systems, as well as bacterium-bacterium and 
bacterium-host interactions mediated by a range of proteins. 
Orthologs of some of these proteins have only been identified in 
animal and human pathogens; their presence in X. fastidiosa 
indicates that the mol. basis for bacterial pathogenicity is both 
conserved and independent of host. At least 83 genes are 
bacteriophage-derived and include virulence-assocd. genes from other 
bacteria, providing direct evidence of phage-mediated horizontal 
gene transfer. 

284706-28-1, Protein (Xylella fastidiosa gene XF1737) 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 

(Biological study) 

(amino acid sequence; genome sequence of the plant pathogen 

Xylella fastidiosa) 
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The genome sequence of Drosophila melanogaster 
Adams, Mark D.; Celniker, Susan E.; Holt, Robert 
A.; Evans, Cheryl A.; Gocayne, Jeannine D.; 
Amanatides, Peter G.; Scherer, Steven E . ; Li, 
Peter W.; Hoskins, Roger A.; Galle, Richard F. ; 
George, Reed A.; Lewis, Suzanna E. ; Richards, 
Stephen; Ashburner, Michael; Henderson, Scott 
N.; Sutton, Granger G.; Wortman, Jennifer R.; 
Yandell, Mark D.; Zhang, Qing; Chen, Lin X.; 
Brandon, Rhonda C; Rogers, Yu-Hui C; Blaze j, 
Robert G.; Champe, Mark; Pfeiffer, Barret D.; 
Wan, Kenneth H.; Doyle, Clare; Baxter, Evan G. ; 
Helt, Gregg; Nelson, Catherine R.; Miklos, 
George L. Gabor; Abril, Josep F. ; Agbayani, 
Anna; An, Hui-Jin; Andrews-Pf annkoch, Cynthia; 
Baldwin, Danita; Ballew, Richard M.; Basu, 
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Anand; Baxendale, James; Bayraktaroglu, Leyla; 
Beasley, Ellen M. ; Beeson, Karen Y . ; Benos, P. 
V.; Berman, Benjamin P.; Bhandari, Deepali; 
Bolshakov, Slava; Borkova, Dana; Botchan, 
Michael R.; Bouck, John; Brokstein, Peter; 
Brottier, Phillipe; Burtis, Kenneth C; Busam, 
Dana A. ; Butler, Heather; Cadieu, Edouard; 
Center, Angela; Chandra, Ishwar; Cherry, J. 
Michael; Cawley, Simon; Dahlke, Carl; Davenport, 
Lionel B.; Davies, Peter; De Pablos, Beatriz; 
Delcher, Arthur; Deng, Zuoming; Mays, Anne 
Deslattes; Dew, Ian; Dietz, Suzanne M. ; Dodson, 
Kristina; Doup, Lisa E.; Downes, Michael; 
Dugan-Rocha, Shannon; Dunkov, Boris C; Dunn, 
Patrick; Durbin, Kenneth J.; Evangelista, Carlos 
C; Ferraz, Concepcion; Ferriera, Steven; 
Fleischmann, Wolfgang; Foster, Carl; Gabrielian, 
Andrei E . ; Garg, Neha S.; Gelbart, William M.; 
Glasser, Ken; Glodek, Anna; Gong, Fangcheng; 
Gorrell, J. Harley; Gu, Zhiping; Guan, Ping; 
Harris, Michael; Harris, Nomi L.; Harvey, Damon; 
Heiman, Thomas J. ; Hernandez, Judith R. ; Houck, 
Jarrett; Hostin, Damon; Houston, Kathryn A.; 
Howland, Timothy J.; Wei, Ming-Hui; Ibegwam, 
Chinyere; Jalali, Mena; Kalush, Francis; Karpen, 
Gary H.; Ke, Zhaoxi; Kennison, James A,; 
Ketchum, Karen A.; Kimmel, Bruce E.; Kodira, 
Chinnappa D.; Kraft, Cheryl; Kravitz, Saul; 
Kulp, David; Lai, Zhongwu; Lasko, Paul; Lei, 
Yiding; Levitsky, Alexander A.; Li, Jiayin; Li, 
Zhenya; Liang, Yong; Lin, Xiaoying; Liu, 
Xiangjun; Mattei, Bettina; Mcintosh, Tina C; 
McLeod, Michael P.; McPherson, Duncan; Merkulov, 
Gennady; Milshina, Natalia V.; Mobarry, Clark; 
Morris, Joe; Moshrefi, Ali; Mount, Stephen M.; 
Moy, Mee; Murphy, Brian; Murphy, Lee; Muzny, 
Donna M. ; Nelson, David L.; Nelson, David R. ; 
Nelson, Keith A.; Nixon, Katherine; Nusskern, 
Deborah R. ; Pacleb, Joanne M.; Palazzolo, 
Michael; Pittman, Gjange S.; Pan, Sue; Pollard, 
John; Puri, Vinita; Reese, Martin G. ; Reinert, 
Knut; Remington, Karin; Saunders, Robert D. C. ; 
Scheeler, Frederick; Shen, Hua; Shue, Bixiang 
Christopher; Siden-Kiamos, Inga; Simpson, 
Michael; Skupski, Marian P.; Smith, Tom; Spier, 
Eugene; Spradling, Allan C; Stapleton, Mark; 
Strong, Renee; Sun, Eric; Svirskas, Robert; 
Tector, Cyndee; Turner, Russell; Venter, Eli; 
Wang, Aihui H.; Wang, Xin; Wang, Zhen-Yuan; 
Wassarman, David A.; Weinstock, George M.; 
Weissenbach, Jean; Williams, Sherita M. ; 
Woodage, Trevor; Worley, Kim C; Wu, David; 
Yang, Song; Yao, Q. Alison; Ye, Jane; Yeh, 
Ru-Fang; Zaveri, Jayshree S.; Zhan, Ming; Zhang, 
Guangren; Zhao, Qi; Zheng, Liansheng; Zheng, 
Xiangqun H . ; Zhong, Fei N. ; Zhong, Wenyan; Zhou, 
Xiaojun; Zhu, Shiaoping; Zhu, Xiaohong; Smith, 
Hamilton 0.; Gibbs, Richard A.; Myers, Eugene 



Searcher : Shears 308-4994 



09/627383 



W.; Rubin, Gerald M. ; Venter, J. Craig 
CORPORATE SOURCE: Celera Genomics, Rockville, MD, 20850, USA 

SOURCE: Science (Washington, D. C.) (2000), 287(5461), 

2185-2195 

CODEN: SCIEAS; ISSN: 0036-8075 
PUBLISHER: American Association for the Advancement of 

Science 

DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB The fly Drosophila melanogaster is one of the most intensively 
studied organisms in biol. and serves as a model system for the 
investigation of many developmental and cellular processes common to 
higher eukaryotes, including humans. The nucleotide sequence was 
detd. of nearly all of the . apprx. 120-megabase euchromatic portion 
of the Drosophila genome using a whole-genome shotgun sequencing 
strategy supported by extensive clone-based sequence and a 
high-quality bacterial artificial chromosome phys . map. Efforts are 
under way to close the remaining gaps; however, the sequence is of 
sufficient accuracy and contiguity to be declared substantially 
complete and to support an initial anal, of genome structure and 
preliminary gene annotation and interpretation. The genome encodes 
.apprx. 13, 600 genes, somewhat fewer than the smaller Caenorhabditis 
elegans genome, but with comparable functional diversity. Access to 
supporting information on each gene is available through FlyBase at 
http://flybase.bio.indiana.edu and through Celera at www.celera.com; 
the sequences are deposited in GenBank with Accession Nos. 
AE002566-AE003403. [This abstr. record is one of 4 records for this 
document necessitated by the large no. of index entries required to 
fully index the document and publication system restraints.]. 

IT 263525-23-1 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; genome sequence of Drosophila melanogaster) 
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Adams, Mark D.; Celniker, Susan E.; Holt, Robert 
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Amanatides, Peter G.; Scherer, Steven E.; Li, 
Peter W.; Hoskins, Roger A,; Galle, Richard F. ; 
George, Reed A.; Lewis, Suzanna E.; Richards, 
Stephen; Ashburner, Michael; Henderson, Scott 
N.; Sutton, Granger G.; Wortman, Jennifer R.; 
Yandell, Mark D. ; Zhang, Qing; Chen, Lin X.; 
Brandon, Rhonda C; Rogers, Yu-Hui C. ; Blazej, 
Robert G,; Champe, Mark; Pfeiffer, Barret D.; 
Wan, Kenneth H.; Doyle, Clare;' Baxter, Evan G.; 
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Baldwin, Danita; Ballew, Richard M.; Basu, 
Anand; Baxendale, James; Bayraktaroglu, Leyla; 
Beasley, Ellen M.; Beeson, Karen Y.; Benos, P. 
V.; Berman, . Benjamin P.; Bhandari, Deepali; 
Bolshakov, Slava; Borkova, Dana; Botchan, 
Michael R. ; Bouck, John; Brokstein, Peter; 
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Brottier, Phillipe; Burtis, Kenneth C; Busam, 
Dana A.; Butler, Heather; Cadieu, Edouard; 
Center, Angela; Chandra, Ishwar; Cherry, J. 
Michael; Cawley, Simon; Dahlke, Carl; Davenport, 
Lionel B.; Davies, Peter; De Pablos, Beatriz; 
Delcher, Arthur; Deng, Zuoming; Mays, Anne 
Deslattes; Dew, Ian; Dietz, Suzanne M . ; Dodson, 
Kristina; Doup, Lisa E.; Downes, Michael; 
Dugan-Rocha, Shannon; Dunkov, Boris C; Dunn, 
Patrick; Durbin, Kenneth J.; Evangelista, Carlos 
C; Ferraz, Concepcion; Ferriera, Steven; 
Fleischmann, Wolfgang; Foster, Carl; Gabrielian, 
Andrei E.; Garg, Neha S.; Gelbart, William M.; 
Glasser, Ken; Glodek, Anna; Gong, Fangcheng; 
Gorrell, J. Harley; Gu, Zhiping; Guan, Ping; 
Harris, Michael; Harris, Nomi L.; Harvey, Damon; 
Heiman, Thomas J.; Hernandez, Judith R. ; Houck, 
Jarrett; Hostin, Damon; Houston, Kathryn A.; 
Howland, Timothy J.; Wei, Ming-Hui; Ibegwam, 
Chinyere; Jalali, Mena; Kalush, Francis; Karpen, 
Gary H.; Ke, Zhaoxi; Kennison, James A. ; 
Ketchum, Karen A.; Kimmel, Bruce E.; Kodira, 
Chinnappa D.; Kraft, Cheryl; Kravitz, Saul; 
Kulp, David; Lai, Zhongwu; Lasko, Paul; Lei, 
Yiding; Levitsky, Alexander A.; Li, Jiayin; Li, 
Zhenya; Liang, Yong; Lin, Xiaoying; Liu, 
Xiangjun; Mattei, Bettina; Mcintosh, Tina C; 
McLleod, Michael P.; McPherson, Duncan; 
Merkulov, Gennady; Milshina, Natalia V. ; 
Mobarry, Clark; Morris, Joe; Moshrefi, Ali; 
Mount, Stephen M. ; Moy, Mee; Murphy, Brian; 
Murphy, Lee; Muzny, Donna M. ; Nelson, David L.; 
Nelson, David R.; Nelson, Keith A.; Nixon, 
Katherine; Nusskern, Deborah R.; Pacleb, Joanne 
M.; Palazzolo, Michael; Pittman, Gjange S.; Pan, 
Sue; Pollard, John; Puri, Vinita; Reese, Martin 
G.; Reinert, Knut; Remington, Karin; Saunders, 
Robert D. C; Scheeler, Frederick; Shen, Hua; 
Shue, Bixiang Christopher; Siden-Kiamos , Inga; 
Simpson, Michael; Skupski, Marian P.; Smith, 
Tom; Spier, Eugene; Spradling, Allan C. ; 
Stapleton, Mark; Strong, Renee; Sun, Eric; 
Svirskas, Robert; Tector, Cyndee; Turner, 
Russell; Venter, Eli; Wang, Aihui H.; Wang, Xin; 
Wang, Zhen-Yuan; Wassarman, David A. ; Weinstock, 
George M . ; Weissenbach, Jean; Williams, Sherita 
M . ; Woodage, Trevor; Worley, Kim C; Wu, David; 
Yang, Song; Yao, Q. Alison; Ye, Jane; Yeh, 
Ru-Fang; Zaveri, Jayshree S.; Zhan, Ming; Zhang, 
Guangren; Zhao, Qi; Zheng, Liansheng; Zheng, 
Xiangqun H . ; Zhong, Fei N. ; Zhong, Wenyan; Zhou, 
Xiaojun; Zhu, Shiaoping; Zhu, Xiaohong; Smith, 
Hamilton 0.; Gibbs, Richard A.; Myers, Eugene 
.W.; Rubin, Gerald M. ; Venter, J. Craig 
Celera Genomics, Rockville, MD, 20850, USA 
Science (Washington, D. C.) (2000), 287(5461), 
2185-2195 

CODEN: SCIEAS; ISSN: 0036-8075 
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PUBLISHER: American Association for the Advancement of 

Science 

DOCUMENT .TYPE: ' Journal 
LANGUAGE: English 

AB The fly Drosophila melanogaster is one of the most intensively 
studied organisms in biol . and serves as a model system for the 
investigation of many developmental and cellular processes common to 
higher eukaryotes, including humans. The nucleotide sequence was 
detd. of nearly all of the . apprx . 120-megabase euchromatic portion 
of the Drosophila genome using a whole-genome shotgun sequencing 
strategy supported by extensive clone-based sequence and a 
high-quality bacterial artificial chromosome phys . map. Efforts are 
under way to close the remaining gaps; however, the sequence is of 
sufficient accuracy and contiguity to be declared substantially 
complete and to support an initial anal, of genome structure and 
preliminary gene annotation and interpretation. The genome encodes 
. apprx. 13, 600 genes, somewhat fewer than the smaller Caenorhabditis 
elegans genome, but with comparable functional diversity. Access to 
supporting information on each gene is available through FlyBase at 
http://flybase.bio.indiana.edu and through Celera at www.celera.com; 
the sequences are deposited in GenBank with Accession Nos . 
AE002566-AE003403. [This abstr. record is one of 4 records for this 
document necessitated by the large no. of index entries required to 
fully index the document and publication system constraints.]. 
262988-39-6 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; genome sequence of Drosophila melanogaster) 
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132:204089 

Protein and cDNA sequences encoding Neisseria 
meningitidis NMASP protein, and uses thereof in 
treating meningitis 

Jackson, W. James; Harris, Andrea M. 

Antex Biologies Inc., USA 

PCT Int. Appl., 7 6 pp. 
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Patent 

English 
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DK, ES, 
CF, CG, 

US 2002018782 

AU 9957894 

EP 1109454 

R: AT, BE, 
PT, IE, 

PRIORITY APPLN. INFO 



FI, FR, GB, GR, 
CI, CM, GA, GN, 
Al 20020214 
Al 20000321 
A2 20010627 
CH, DE, DK, ES, 
SI, LT, LV, FI, 



IE, 
GW, 



FR, 
RO 



IT, LU, MC, NL, PT, SE, BF, BJ, 

ML, MR, NE, SN, TD, TG 

US 1999-388089 19990831 
AU 1999-57894 19990901 
EP 1999-945257 19990901 

GB, GR, IT, LI, LU, NL, SE, MC, 



US 1998-98685P P 19980901 
WO 1999-US19663 W 19990901 
AB The invention discloses the Neisseria meningitidis NMASP protein and 
cDNA sequences, derivs. thereof (NMASP-derived polypeptides), and 
antibodies that specifically bind the NMASP protein and/or 
NMASP-derived polypeptides. The NMASP protein of the invention has 
limited similarity (36% sequence identity) to the DegP (HtrA) 
protein of E. coli and has not been previously identified in any N. 
meningitidis. Also disclosed are prophylactic or therapeutic 
compns., including immunogenic compns . like vaccines, comprising 
NMASP protein and/or a NMASP-derived polypeptide. The invention is 
particularly directed toward compns. for treating/preventing 
meningitis. The invention addnl . discloses methods of inducing an 
immune response to N. meningitidis and N. meningitidis NMASP protein 
and/or a NMASP-derived polypeptide in animals. 
IT 260386-80-9P, Protein NMASP (Neisseria meningitidis) 

RL: BOC (Biological occurrence); BPN (Biosynthetic preparation); BSU 
(Biological study, unclassified); PRP (Properties); THU (Therapeutic 
use); BIOL (Biological study); OCCU (Occurrence); PREP 
(Preparation); USES (Uses) 

(amino acid sequence; protein and cDNA sequences encoding 
Neisseria meningitidis NMASP protein, and uses thereof in 
treating and diagnosing meningitis) 

L2 ANSWER 17 OF 23 HCAPLUS COPYRIGHT 2002 ACS 



ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



INVENTOR (S) : 
PATENT ASSIGNEE (S) : 
SOURCE.: 

DOCUMENT TYPE: 
LANGUAGE : 

FAMILY ACC. NUM. COUNT: 
PATENT INFORMATION: 



2000: 15224 HCAPLUS 
132:74540 

Protein and cDNA sequences of human tumor 
suppressor proteins encoded by AZ-1 and AZ-2 
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PCT Int. Appl., 120 pp. 
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CF, CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG 
AU 9948347 Al 20000117 AU 1999-48347 19990625 

PRIORITY APPLN. INFO. : US 1998-90747P P 19980626 

WO 1999-US14482 W 19990625 
The invention provides protein and cDNA sequences of human tumor 
suppressor proteins encoded by AZ-1 and AZ-2 genes. Preferably, the 
invention relates to the tumor suppressor encoded by the AZ-1 gene 
and to the detection of its level in breast cells as a marker of 
malignancy progression and/or tumorigenic reversion. Thus, the 
invention also relates to the diagnosis, prevention, and/or 
treatment of breast cancer. The invention also concerns monoclonal 
or polyclonal antibodies specific to AZ-1, AZ-2 encoded protein and 
to AZ-1, or AZ-2 encoded protein homologs . 
253582-33-1P 

RL: BAC (Biological activity or effector, except adverse); BOC 
(Biological occurrence) ; BPN (Biosynthetic preparation) ; BSU 
(Biological study, unclassified); PRP (Properties); THU (Therapeutic 
use); BIOL (Biological study); OCCU (Occurrence); PREP 
(Preparation); USES (Uses) 

(amino acid sequence; protein and cDNA sequences of human tumor 
suppressor proteins encoded by AZ-1 and AZ-2 genes, and uses 
thereof in the diagnosis, prevention, and/or treatment of breast 
cancer) 
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Genome sequence of the radioresistant bacterium 
Deinococcus radiodurans Rl 

White, Owen; Eisen, Jonathan A. ; Heidelberg, 
John F.; Hickey, Erin K. ; Peterson, Jeremy D.; 
Dodson, Robert J.; Haft, Daniel H.; Gwinn, 
Michelle L.; Nelson, William C; Richardson, 
Delwood L.; Moffat, Kelly S.; Qin, Haiying; 
Jiang, Lingxia; Pamphile, Wanda; Crosby, Marie; 
Shen, Mian; Vamathevan, Jessica J.; Lam, Peter; 
McDonald, Lisa; Utterback, Terry; Zalewski, 
Celeste; Makarova, Kira S.; Aravind, L.; Daly, 
Michael J.; Minton, Kenneth W. ; Fleischmann, 
Robert D.; Ketchum, Karen A.; Nelson, Karen E.; 
Salzberg, Steven; Smith, Hamilton 0.; Venter, J. 
Craig; Fraser, Claire M. 

The Institute for Genomic Research, Rockville, 
MD, 20850, USA 

Science (Washington, D. C.) (1999), 286(5444), 
1571-1577 

CODEN: SCIEAS; ISSN: 0036-8075 
American Association for the Advancement of 
Science 
Journal 
English 

genome sequence of the radiation-resistant bacterium 
adiodurans Rl is composed of two chromosomes (2, 648,63£ 
ase pairs), a megaplasmid (177,466 base pairs), and a 
(45,704 base pairs), yielding a total genome of 
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3/284,156 base pairs. Multiple components distributed on the 
chromosomes and megaplasmid that contribute to the ability of D. 
radiodurans to survive under conditions of starvation, oxidative 
stress, and high amts. of DNA damage were identified. Deinococcus 
radiodurans represents an organism in which all systems for DNA 
repair, DNA damage export, desiccation and starvation recovery, and 
genetic redundancy are present in one cell. 
IT 250313-43-0 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; genome sequence of the radioresistant 
bacterium Deinococcus radiodurans Rl) 
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DOCUMENT NUMBER: 131:253834 

TITLE: The Cloning and Developmental Expression of 

Unconventional Myosin IXA (MY09A) a Gene in the 
Bardet-Biedl Syndrome (BBS4) Region at 
Chromosome 15q22-q23 

Gorman, Susan W. ; Haider, Neena B.; Grieshammer, 
Uta; Swiderski, Ruth E.; Kim, Esther; Welch, 
Juliet W.; Searby, Charles; Leng, Song; Carmi, 
Rivka; Sheffield, Val C; Duhl, David M. 
Chiron Corporation, Emeryville, CA, 94 608, USA 
Genomics (1999), 59(2), 150-160 
CODEN: GNMCEP; ISSN: 0888-7543 
Academic Press 
Journal 
English 

Bardet-Biedl Syndrome (BBS ) is a heterogeneous, autosomal recessive 
disorder characterized by mental retardation, obesity, retinitis 
pigmentosa, syndactyly and/or Polydactyly, short stature, and 
hypogenitalism and is caused by mutations at a no. of distinct loci. 
Using a positional cloning approach for identifying the BBS4 
(chromosome 15) gene, we identified and cloned an unconventional 
myosin gene, myosin IXA (HGMW-approved symbol MY09A) . Since 
mutations in unconventional myosins are known to cause several human 
diseases, and since mutations of unconventional myosin Vila cause 
retinal degeneration, we evaluated myosin IXA as. a candidate for 
BBS. We exploited PCR-based techniques to clone a 8473-nt cDNA for 
myosin IXA. A 7644-bp open reading frame predicts a protein with 
all the hallmarks of class IX unconventional myosins. Human 
Northern blot anal, and in situ hybridization of mouse embryos 
reveal that myosin IXA is expressed in many tissues consistent with 
BBS. Intron/exon boundaries were identified, and myosin IXA DNA and 
RNA from BBS4 patients were evaluated for mutation. (c) 1999 
Academic Press. 
244613-54-5 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; cloning, cDNA sequence, and mRNA expression 
of unconventional myosin IXA gene (MY09A) , a gene in the 
Bardet-Biedl Syndrome (BBS4) region at chromosome 15q22-q23) 
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HCAPLUS COPYRIGHT 2002 ACS 
1999:271499 HCAPLUS 
130:292458 

The amino acid and nucleic acid sequences of 
Myosin IXa and cyclic nucleotide gated channel 
and their uses in diagnosis and treatment of 
human diseases 

Gorman, Susan W. ; Welch, Julie; Duhl, David; 
Leng, Song; Adams, Arwen; Sheffield, Val; Chiu, 
Choi Ying 

Chiron Corporation, USA 
PCT Int. Appl., 69 pp. 
CODEN: PIXXD2 
Patent 
English 
1 



PATENT NO. 



KIND DATE 



APPLICATION NO. DATE 



WO 



AU 
US 



9919489 
W: AL, 
CZ, 
GM, 
LS, 
RU, 
UZ, 
RW: GH, 
ES, 
CG, 
9910986 
6300485 



AM, 
CZ, 
HR, 
LT, 
SD, 
VN, 
GM, 
FI, 
CI, 



Al 19990422 
AT, AT, AU, AZ, 



DE, 

HU, 

LU, 

SE, 

YU, 

KE, 

FR, 

CM, 
Al 
Bl 



DE, 
ID, 
LV, 
SG, 
ZW, 
LS, 
GB, 
GA, 



DK, 
IL, 
MD, 
SI, 
AM, 
MW, 
GR, 
GN, 



DK, 
IS, 
MG, 
SK, 
AZ, 
SD, 
IE, 
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BA, 
EE, 
JP, 
MK, 
SK, 
BY, 
SZ, 
IT, 
ML, 



WO 1998-US21971 19981014 
BB, BG, BR, BY, CA, CH, CN, 
FI, 
KR, 
NO, 
TR, 
RU, 
BE, 
PT, 
TD, 



EE, 
KE, 
MN, 
SL, 
KG, 
UG, 
LU, 
MR, 



ES, 
KG, 
MW, 
TJ, 
KZ, 
ZW, 
MC, 
NE, 



19990503 
20011009 



PRIORITY APPLN. INFO. 



US 
US 
US 
WO 



FI, 
KP, 
MX, 
TM, 
MD, 
AT, 
NL, 
SN, 

AU 1999-10986 
US 1998-172422 
1997-62858P P 
1997-62241P P 

1997- 68953P P 

1998- US21971 W 



GB, 
KZ, 
NZ, 
TT, 
TJ, 
CH, 
SE, 
TG 



GD, 

LC, 

PL, 

UA, 

TM 

CY, 

BF, 



GE, 
LK, 
PT, 
UG, 

DE, 
BJ, 



CU, 
GH, 
LR, 
RO, 
US, 

DK, 
CF, 



19981014 
19981014 
19971015 
19971017 
19971230 
19981014 



AB The amino acid and nucleic acid sequences of a new cyclic nucleotide 
gated channel-15 (CNGC-15) and Myosin IXa that map to the region of 
the human chromosome assocd. with Bardet-Biedl Syndrome are 
disclosed. CNGCs comprise a family of multimeric protein ion 
channels that open in response to the binding of a cyclic nucleotide 
to an intracellular domain. The two new proteins, CNGC-15 and 
Myosin IXa, are useful in the study, diagnosis and treatment of 
Bardet-Biedl Syndrome and Usher Syndrome. Other indications that 
can be treated by CNGC-15 and/or Myosin IXa polypeptides, or 
agonists or antagonists include hearing loss, retinitis pigmentosa, 
obesity, hypogonadism, sterility, Polydactyly, brachydactyly, 
syndactyly, mental retardation, renal abnormalities, hypertension, 
diabetes and cardiovascular abnormalities. Compns . and methods for 
expressing CNGC and Myosin IXa are provided. The compns. comprise 
CNGC-15 and Myosin IXa and polypeptides and derivs. thereof, 
nucleotide sequences, expression cassettes, transformed cells and 
antibodies to these polypeptides. Methods for the expression and 
detection of CNGC and Myosin IXa nucleotides and polypeptides and 
compns. for the treatment of these conditions are also provided. 
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IT 222964-43-4, Myosin IXa (human clone BAC) 

RL: BSU (Biological study, unclassified); PRP. (Properties ) ; THU 
(Therapeutic use); BIOL (Biological study); USES (Uses) 

(amino acid sequence; amino acid and nucleic acid sequences of 
Myosin IXa and cyclic nucleotide gated channel (CNGC) and uses in 
diagnosis and treatment of human diseases) 
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TITLE: Myr 7 is a novel myosin IX-RhoGAP expressed in 

rat brain 

AUTHOR (S) : Chieregatti, Evelina; Gartner, Annette; 

Stoffler, Hanns-Eugen; Bahler, Martin 
CORPORATE SOURCE: Friedrich-Miescher Laboratorium in the 

Max-Planck Society, Tubingen, 7207 6, Germany 
SOURCE: Journal of Cell Science (1998), 111(24), 

3597-3608 

CODEN: JNCSAI; ISSN: 0021-9533 
PUBLISHER: Company of Biologists Ltd. 

DOCUMENT TYPE: Journal 
LANGUAGE: English 

AB Rho family GTPases are important regulators of neuronal morphol . , 

but the proteins directly controlling their activity in neurons are 
still poorly defined. The authors report the identification of myr 
7, a novel unconventional myosin IX-RhoGAP expressed in rat brain. 
Myr 7 is a multidomain protein related to myr 5, the first class IX 
myosin to be characterized. It exhibits a myosin head domain with 
an N-terminal extension and a large insertion at loop 2, an actin 
contact site and regulator of myosin ATPase rate. The myosin head 
domain is followed by a neck domain consisting of six unevenly 
spaced consecutive IQ motifs representing light chain binding sites. 
The tail domain contains a C6H2-zinc binding motif and a region that 
specifically stimulates the GTPase-activity of Rho followed by a 
short stretch predicted to adopt a coiled-coil structure. Five 
alternatively spliced regions, one in the 5 ' -noncoding region, two 
in the myosin head and two in the tail domain, were noted. Anal, of 
myr 7 and myr 5 expression in different tissues revealed that myr 7 
is expressed at high levels in developing and adult brain tissue 
whereas myr 5 is expressed only at moderate levels in embryonic 
brain tissue and at even further reduced levels in adult brain 
tissue. Myr 5 is, however, highly expressed in lung, liver, spleen 
and testis. Myr 7 is expressed in all brain regions and is 
localized in the cytoplasm of cell bodies, dendrites and axons. Myr 
5 exhibits an overlapping, but not identical cellular distribution. 
Finally, a myr 7 fusion protein encompassing the GAP domain 
specifically activates the GTPase-activity of Rho in vitro, and 
overexpression of myr 7 in HtTAl-HeLa cells leads to inactivation of 
Rho in vivo. These results are compatible with a role for myr 7 
(and myr 5) in regulating Rho activity in neurons and hence in 
regulating neuronal morphol. and function. 

IT 221651-87-2 

RL: BOC (Biological occurrence); BPR (Biological process); BSU 
(Biological study, unclassified); PRP (Properties); BIOL (Biological 
study); OCCU (Occurrence); PROC (Process) 
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(amino acid sequence; cDNA sequence of human and rat myr 7, a 
novel myosin IX-rhoGAP expressed in rat brain) 
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1999:9860 HCAPLUS 
130: 77730 

sequence of human and mouse neuro-growth factor 
like protein Zneul and antibodies for detection 
strategies 

Sheppard, Paul O.; Jelinek, Laura J.; Whitmore, 

Theodore E.; Blumberg, Hal; Lehner, Joyce M. 

Zymogenetics, Inc., USA 

PCT Int. Appl., 70 pp. 

CODEN: PIXXD2 

Patent 

English 

1 



PATENT 


NO. 




KIND 


DATE 






APPLICATION NO. 


DATE 




WO 


9857983 




A2 




1998 


1223 




WC 


> 1998-US12763 


19980618 
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A3 




19990318 
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KE, 


LS, 


MW, 


SD, 


SZ, 


UG, 


ZW, 
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20010809 
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996628 




A2 




20000503 
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AT, 
PT, 


BE, 
IE, 


CH, 
FI 


DE, 


DK, 


ES, 
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GB, 


GR, 
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NL, SE, 
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T2 




20020226 
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US 1997-50143P P 19970618 
US 1997-878322 A 19970618 
WO 1998-US12763 W 19980618 
AB A novel mammalian neuro-growth factor like polypeptide Zneul, 

polynucleotides encoding the polypeptides, and related compns . and 
detection methods including antibodies and anti-idiotypic antibodies 
and humanized antibodies and antibody fragments single-chain 
antibodies are presented. An expression vector is described for 
effective Zneul protein expression in a eukaryotic cell. In addn., 
chimeric proteins involving Zneul are described. 
IT 218778-65-5 218778-68-8 

RL: BSU (Biological study, unclassified); PRP (Properties); BIOL 
(Biological study) 

(amino acid sequence; sequence of human and mouse neuro-growth 
factor like protein Zneul and antibodies for detection 
strategies) 
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ACCESSION NUMBER: 1996:319778 HCAPLUS 

DOCUMENT NUMBER: 125:82238 

TITLE: Distinct cellular and subcellular patterns of 

expression imply distinct functions for the 
Drosophila homologs of moesin and the 
neurofibromatosis 2 tumor suppressor, merlin 
McCartney, Brooke M. ; Fehon, Richard G. 
Dev., Cell., Molecular Biol. Group., Dep. 
Zoology, Duke Univ., Durham, NC, 27708-1000, USA 
J. Cell Biol. (1996), 133(4), 843-852 
CODEN: JCLBA3 ; ISSN: 0021-9525 
Journal 
English 

Interest in members of the protein 4.1 super-family, . which includes 
the ezrin-radixin-moesin (ERM) group, has been stimulated recently 
by the discovery that the human neurofibromatosis 2 (NF2) tumor 
suppressor gene encodes an ERM-like protein, merlin. Although many 
proteins in this family are thought to act by linking the 
actin-based cytoskeleton to transmembrane proteins, the cellular 
functions of merlin have not been defined. To investigate the 
cellular and developmental functions of these proteins, Drosophila 
homologs of moesin (Dmoesin) and the NF2 tumor suppressor merlin 
(Dmerlin) were identified and characterized. Specific antibodies 
were used to show that although these proteins are frequently 
coexpressed in developing tissues, they display distinct subcellular 
localizations. Whereas Dmoesin is obsd. in continuous assocn. with 
the plasma membrane, as a typical for an ERM family protein, Dmerlin 
is found in punctate structures at the membrane and in cytoplasm. 
Investigation of Dmerlin in cultured cells demonstrates that it is 
assocd. with endocytic compartments. As a result of these studies, 
the merlin protein is proposed to have unique functions in the cell 
which differ from those of other ERM family members. 
178535-96-1 

RL: BOC (Biological occurrence); PRP (Properties); BIOL (Biological 
study) ; OCCU (Occurrence) 

(amino acid sequence; distinct cellular and subcellular patterns 
of expression imply distinct functions for the Drosophila 
homologs of moesin and the neurofibromatosis 2 tumor suppressor 
merlin) 
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28 SEA FILE=REGISTRY ABB=ON PLU=ON (178535 

218778-65-5/BI OR 218778-68-8/BI OR 22165 

222964-43-4/BI OR 24 4 613-54-5/BI OR 25031 

253582-33-1/BI OR 260386-80-9/BI OR 26298 

263525-23-1/BI OR 284706-28-1/BI OR 31692 

319502-44-8/BI OR 324745-26-8/BI OR 32692 

331287-05-9/BI OR 332001-59-9/BI OR 33200 

342874-40-2/BI OR 352374-90-4 /BI OR 35237 

352374-92-6/BI OR 355157-34-5/BI OR 36965 

371995-57-2/BI OR 403520-7 1-8/BI OR 40354 



-96-1/BI OR 
1-87-2/BI OR 

3- 43-0/BI OR 
8-39-6/BI OR 

7- 56-7/BI OR 

8- 35-2/BI OR 
1-60-2/BI OR 

4- 91-5/BI OR 

9- 63-2/BI OR 
6-70-3/BI) 
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RN 403546-70-3 REGISTRY 

CN Protein (human clone WO0188088-SEQID-14753 fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN 2753: PN: WO0188088 SEQID: 14753 claimed protein 
CI MAN 
SQL 108 

SEQ 1 SHAGCLIRFW RKSMTPTHSL PLTPTFLGTC EASFLEPRAS PVPPQCSMAL 



51 RRYRLDMGQS FWGGLPSSHP PDPSRPGFVP GVGHVPGQEG PGGKPAPDSS 
101 XHXDPTGG 
HITS AT: 35-40 

REFERENCE 1: 136:227947 

L4 ANSWER 2 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 403520-71-8 REGISTRY 

CN Protein (human clone WO0188088-SEQID-12024 fragment) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN 24: PN: WO0188088 SEQID: 12024 claimed protein 
CI MAN 
SQL 75 

SEQ 1 GHTGPLGSPW SSVWVCLAGR QVPGPQHPHR PPGCSWGCRP PAGTGPRLPS 

51 ASAPRCCPPR MRLEPRASRR SGTSG 



HITS AT: 63-68 
REFERENCE 1: 136:227947 

L4 ANSWER 3 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 371995-57-2 REGISTRY 

CN Protein ( Propionibacterium acnes strain ATCC6919 clone 

WO0181581-SEQID-896 open reading frame) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 896: PN: WO0181581 SEQID: 896 claimed protein 
CI MAN 
SQL 125 

SEQ 1 RISQHVLARR GAHDRRNRCE AGLASSTPPT LTHNEFVAVV GWGHNYRLQN 

51 SDRPDRLREF GQFFLVKHFT RLARVRRDLI HGDELEPRAS NTFVYRVAVS 



101 TSVINALFVI IIEIRALKQL SETPP 
HITS AT: 85-90 

REFERENCE 1: 135:353807 

L4 ANSWER 4 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 369659-63-2 REGISTRY 

CN Protein (human clone WO0179449-SEQID-7642 fragment) (9CI) (CA INDEX 

NAME) 
OTHER NAMES: 

■CN 116: PN: WO0179449 SEQID: 2145 claimed sequence 
CI MAN 
SQL 119 
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SEQ 1 LTQLKTHCPL IKSKTMNKKR AIREPAQEPG PQKEENPKKH RSPSFTSTSF 

51 PGLEVPASYS PPTKAEQPGQ VRKAVQPAVR LEPRASHPAG PPVPPSGVLV 



101 SRRRPEPGQG KPPESDFDH 
HITS AT: 81-86 

REFERENCE 1: 135:353783 

L4 ANSWER 5 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 355157-34-5 REGISTRY 

CN Protein ECTACC (human) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AF220152-derived protein GI 7934572 
CI MAN 
SQL 906 



SEQ 1 MFWYKRRGAP MRPVSVIDGV VCVSPGCGSE TVPVPDGPRS DSVEGSPFRP 

51 PSHSFSAVFD EDKPIASSGT YNLDFDNIEL VDTFQTLEPR ASDAKNQEGK 



101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
HITS AT: 



VNTRRKSTDS 

GSSSASSTLK 

GENLASETKT 

PASGGGRVQN 

EFDYSEDKSS 

ASPPRSPAEP 

SYNFDPDTCD 

NKPAKKKKTP 

VHATDEEKLA 

ELDYRNSYEI 

MSESPTPCSG 

KELEAMGLGT 

QQPDLDSALQ 

IAQMIEDEQR 

KEVLEGFRKN 

AQVRGKAQQE 

AKMGKS 

87-92 



VPISKSTLSR 
RTKKPRPPSL 
ESAKTEGPSP 
SPPVGRKTLP 
WDNQQENPPP 
NDIPIAKGTY 
ESVDPFKTSS 
LKTDTFRVKK 
VTNQKWTCMT 
EYMEKIGSSL 
SSFEETEALV 
PSEAIEIREA 
IARAEIITKE 
EKSVSHQTVQ 
EEVLKRCAQE 
QAAHQASLRK 



SLSLQASDFD 
KKKQTTKKPT 
ALLEETPLEP 
LTTAPEAGEV 
TKKIGKKPVA 
TFDIDKWDDP 
KTPSSPSKSP 
SPKRSPLSDP 
VDLEADKQDY 
PQDDDAPKKQ 
NTAAKNQHPV 
AHPTDVSISK 
REVSEWKDKY 
QLVLEKEQAL 
YLSRVKKEEQ 
EQLRVDALER 



GASSSGNPEA 
ETPPVKETQQ 
AVGPKAACPL 
TPSDSGGQED 
KMPLRRPKMK 
NFNPFSSTSK 
AS FE I PAS AM 
PSQDPTPAAT 
PQPSDLSTFV 
ALYLMFDTSQ 
PRGLAPNQES 
TALYSRIGTA 
EESRREVMEM 
ADLNSVEKSL 
RYQALKVHAE 
TLEQKNKEIE 



VALAPDAYST 
EPDEESLVPS 
DSESAEGVVP 
SPAKGLSVRL 
KTPEKLDNTP 
MQESPKLPQQ 
EANGVDGDGL 
PETPPVISAV 
NETKFSSPTE 
ESPVKSSPVR 
HLQVPEKSSQ 
EVEKPAGLLF 
RKIVAEYEKT 
ADLFRRYEKM 
EKLDRANAEI 
ELTKICDELI 



REFERENCE 1: 135:17 6236 

L4 ANSWER 6 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 352374-92-6 REGISTRY 

CN Protein (human clone 784CIF2_39 precursor) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 688: PN: WO0153312 SEQID: 1825 claimed protein 
CI MAN 
SQL 1025 



SEQ 1 MAYASWQRWS PVEWARWMWT AVTSSGDSSL LVLQGDSGKR SSDSEEAFET' 

51 PESTTPVKAP PAPPPPPPEV IPEPEVSTQP PPEEPGCGSE TVPVPDGPRS 

101 DSVEGSPFRP PSHSFSAVFD EDKPIASSGT YNLDFDNIEL VDTFQTLEPR 

151 ASDAKNQEGK VNTRRKSTDS VPISKSTLSR SLSLQASDFD GASSSGNPEA 

201 VALAPDAYST GSSSASSTLK RTKKPRPPSL KKKQTTKKPT ETPPVKETQQ 

251 EPDEESLVPS GENLASETKT ESAKTEGPSP ALLEETPLEP AVGPKAACPL 

301 DSESAEGVVP PASGGGRVQN SPPVGRKTLP LTTAPEAGEV TPSDSGGQED 

351 SPAKGLSVRL EFDYSEDKSS WDNQQENPPP TKKIGKKPVA KMPLRRPKMK 
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401 KTPEKLDNTP ASPPRSPAEP NDIPIAKGTY TFDIDKWDDP NFNPFSSTSK 
4 51 MQESPKLPQQ SYNFDPDTCD ESVDPFKTSS KTPSSPSKSP AS FE I PAS AM 
501 EANGVDGDGL NKPAKKKKTP LKTMVEDVMS VCSLFDTFRV KKSPKRSPLS 
551 DPPSQDPTPA ATPETPPVIS AVVHATDEEK LAVTNQKWTC MTVDLEADKQ 
601 DYPQPSDLST FVNETKFSSP TEELDYRNSY EIEYMEKIGS SLPQDDDAPK 
651 KQALYLMFDT SQESPVKSSP VRMSESPTPC SGSSFEETEA LVNTAAKNQH 
701- PVPRGLAPNQ ESHLQVPEKS SQKELEAMGL GTPSEAIEIT APEGSFASAD 
751 ALLSRLAHPV SLCGALDYLE PDLAEKNPPL FAQKLQREAA HPTDVSISKT 
801 ALYSRIGTAE VEKPAGLLFQ QPDLDSALQI ARAEIITKER EVSEWKDKYE 
851 ESRREVMEMR KIVAEYEKTI AQMIEDEQRE KSVSHQTVQQ LVLEKEQALA 
901 DLNSVEKSLA DLFRRYEBCMK EVLEGFRKNE EVLKRCAQEY LSRVKKEEQR 
951 YQALKVHAEE KLDRANAEIA QVRGKAQQEQ AAHQASLRKE QLRVDALERT 
1001 LEQKNKEIEE LTKICDELIA KMGKS 
HITS AT: . 147-152 

REFERENCE 1: 135:148240 

L4 ANSWER 7 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 352374-91-5 REGISTRY 

CN Protein (human clone 784CIF2_38 precursor) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 687: PN: WO0153312 SEQID:1824 claimed protein 
CI MAN 
SQL 966 

SEQ 1 MAYAS WQRWS PVEWARWMWT AVTSSGDSSL LVLQGDSGKR SSDSEEAFET 

51 PESTTPVKAP PAPPPPPPEV IPEPEVSTQP PPEEPGCGSE TVPVPDGPRS 
101 DSVEGSPFRP PSHSFSAVFD EDKPIASSGT YNLDFDNIEL VDTFQTLEPR 

151 ASDAKNQEGK VNTRRKSTDS VPISKSTLSR SLSLQASDFD GASSSGNPEA 

201 VALAPDAYST GSSSASSTLK RTKKPRPPSL KKKQTTKKPT ETPPVKETQQ 
251 EPDEESLVPS GENLASETKT ESAKTEGPSP ALLEETPLEP AVGPKAACPL 
301 DSESAEGVVP PASGGGRVQN SPPVGRKTLP LTTAPEAGEV TPSDSGGQED 
351 SPAKGLSVRL EFDYSEDKSS WDNQQENPPP TKKIGKKPVA KMPLRRPKMK 
401 KTPEKLDNTP ASPPRSPAEP NDIPIAKGTY TFDIDKWDDP NFNPFSSTSK 
451 MQESPKLPQQ SYNFDPDTCD ESVDPFKTSS KTPSSPSKSP AS FE I PAS AM 
501 EANGVDGDGL NKPAKKKKTP LKTDTFRVKK SPKRSPLSDP PSQDPTPAAT 
551 PETPPVISAV VHATDEEKLA VTNQKWTCMT VDLEADKQDY PQPSDLSTFV 
601 NETKFSSPTE ELDYRNSYEI EYMEKIGSSL PQDDDAPKKQ ALYLMFDTSQ 
651 ESPVKSSPVR MSESPTPCSG SSFEETEALV NTAAKNQHPV PRGLAPNQES 
701 HLQVPEKSSQ KELEAMGLGT PSEAIEIREA AHPTDVSISK TALYSRIGTA 
751 EVEKPAGLLF QQPDLDSALQ IARAEIITKE REVSEWKDKY EESRREVMEM 
801 RKIVAEYEKT IAQMIEDEQR EKSVSHQTVQ QLVLEKEQAL ADLNSVEKSL. 
851 ADLFRRYEKM KEVLEGFRKN EEVLKRCAQE YLSRVKKEEQ RYQALKVHAE 
901 EKLDRANAEI AQVRGKAQQE QAAHQASLRK EQLRVDALER TLEQKNKEIE 
951 ELTKICDELI AKMGKS 
HITS AT: 147-152 

REFERENCE 1: 135:148240 

L4 ANSWER 8 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 352374-90-4 REGISTRY 

CN Protein (human clone 784CIF2__37 precursor) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 686: PN: WO0153312 SEQID: 1823 claimed protein 
CI MAN 
SQL 1013 . 



Searcher : Shears 308-4994 



09/627383 



SEQ 1 MAYASWQRWS PVEWARWMWT AVTSSGDSSL LVLQGDSGKR SSDSEEAFET 

51 PESTTPVKAP PAPPPPPPEV IPEPEVSTQP PPEEPGCGSE TVPVPDGPRS 
101 DSVEGSPFRP PSHSFSAVFD EDKPIASSGT YNLDFDNIEL VDTFQTLEPR 



151 ASDAKNQEGK VNTRRKSTDS VPISKSTLSR SLSLQASDFD GASSSGNPEA 



201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
HITS AT: 



VALAPDAYST 
EPDEESLVPS 
DSESAEGVVP 
SPAKGLSVRL 
KTPEKLDNTP 
MQESPKLPQQ 
EANGVDGDGL 
PETPPVISAV 
NETKFSSPTE 
ESPVKSSPVR 
HLQVPEKSSQ 
CGALDYLEPD 
KPAGLLFQQP 
VAEYEKTIAQ 
FRRYEKMKEV 
DRANAEIAQV 
KICDELIAKM 
147-152 



GSSSASSTLK 
GENLASETKT 
PASGGGRVQN 
EFDYSEDKSS 
ASPPRSPAEP 
SYNFDPDTCD 
NKPAKKKKTP 
VHATDEEKLA 
ELDYRNSYEI 
MSESPTPCSG 
KELEAMGLGT 
LAEKNPPLFA 
DLDSALQIAR 
MIEDEQREKS 
LEGFRKNEEV 
RGKAQQEQAA 
GKS 



RTKKPRPPSL 
ESAKTEGPSP 
SPPVGRKTLP 
WDNQQENPPP 
NDIPIAKGTY 
ESVDPFKTSS 
LKTDTFRVKK 
VTNQKWTCMT 
EYMEKIGSSL 
SSFEETEALV 
PSEAIEITAP 
QKLQREAAHP 
AEIITKEREV 
VSHQTVQQLV 
LKRCAQEYLS 
HQASLRKEQL 



KKKQTTKKPT 
ALLEETPLEP 
LTTAPEAGEV 
TKKIGKKPVA 
TFDIDKWDDP 
KTPSSPSKSP 
SPKRSPLSDP 
VDLEADKQDY 
PQDDDAPKKQ 
NTAAKNQHPV 
EGSFASADAL 
TDVSISKTAL 
SEWKDKYEES 
LEKEQALADL 
RVKKEEQRYQ 
RVDALERTLE 



ETPPVKETQQ 
AVGPKAACPL 
TPSDSGGQED 
KMPLRRPKMK 
NFNPFSSTSK 
AS FE I PAS AM 
PSQDPTPAAT 
PQPSDLSTFV 
ALYLMFDTSQ 
PRGLAPNQES 
LSRLAHPVSL 
YSRIGTAEVE 
RREVMEMRKI 
NSVEKSLADL 
ALKVHAEEKL 
QKNKEIEELT 



REFERENCE 



1: 135:148240 



L4 ANSWER 9 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 342874-40-2 REGISTRY 

CN Calcium channel (Rattus norvegicus brain subunit 

(CA INDEX NAME) 
OTHER NAMES: 



.alpha. 1H) (9CI) 



CN 
CN 
CI 
SQL 

SEQ 



Calcium channel (rat brain subunit .alpha. 1H) 
GenBank AF290213-derived protein GI 11415018 
MAN 
2359 



1 MTEGTLAADE 

51 PESPGTECGA 

101 WFEHVSMLVI 

151 MVIKMVALGL 

201 VRVLRPLRAI 

251 LWAGLLRNRC 

301 KCSHIPSRRE 

351 EFNPHNGAIN 

4 01 LLIIMGSFFM 

451 SEPGSCYEEL 

501 PGRRPRRAGR 

551 LVRACAPPSP 

601 SLKLASGLGT 

651 LGSPRPYEKI 

701 PYCASALEDP 

751 TPGHSNERRR 

801 LVNTLSMGVE 

851 RNPYNIFDGI 

901 QLVVLMRTMD 

951 RKNFDSLLWA 



VRVPLGASPP 
DLGADEEQPV 
MLNCVTLGMF 
FGQKCYLGDT 
NRVPSMRILV 
FLDSAFVRNN 
LRVQCTLGWE 
FDNIGYAWIA 
INLCLVVIAT 
LKYVGHIFRK 
RTASVHHLVY 
PSPGHGPPDS 
MNYPTILPSG 
QDWGEQGLG 
EFEFSGSESG 
TPLRKASQPG 
YHEQPEELTN 
VVVISVWEIV 
NVAT FCMLLM 
IVTVFQILTQ 



APAAPVRASP 
PYPALAATVF 
RPCEDVECRS 
WNRLDFFIVM 
TLLLDTLPML 
NLTFLRPYYQ 
AYGQPQAEDG 
IFQVITLEGW 
QFSETKQREN 
VKRRSLRLYA 
HHHHHHHHHY 
ESVHSIYHAD 
TVNSKGGTSS 
RASSHLSGLS 
DSDAHGVYEF 
GIGHLWASFS 
ALEISNIVFT 
GQADGGQSVL 
LFIFIFSILG 
EDWNVVLYNG 



ASPGAPGREE 
FCLGQTTRPR 
ERCSILEAFD 
AGMMEYSLDG 
GNVLLLCFFV 
TEEGEENPFI 
GAGRNACINW 
VDIMYYVMDA 
QLMREQRARY 
RWQSRWRKKV 
HFSHGGPRRP 
CHVEGPQERA 
RPKGLRGAGA 
VPCPLPSPQA 
TQDVRHGDCR 
GKLRRIVDSK 
SMFALEMLLK 
RTFRLLRVLK 
MHLFGCKFSL 
MASTSSWAAL 



QGGSGSGVLA 
SWCLRLVCNP 
DFIFAFFAVE 
HNVSLSAIRT 
FFIFGIVGVQ 
CSSRRDNGMQ 
NQYYNVCRSG 
HSFYNFIYFI 
LSNDSTLASF 
DPSSTVHGQG 
SPEPGAGDNR 
RVAHSIATAA 
PGAAVHSPLS 
GTLTCELKSC 
DPVQQPHEVG 
YFNRGIMAAI 
LLACGPLGYI 
LVRFLPALRR 
KTDSGDTVPD 
YFVALMTFGN 



Searcher : Shears 308-4994 



09/627383 



1001 YVLFNLLVAI LVEGFQAEGD 

1051 KMYSLAVTPN GHLEGRGSLP 

1101 RRSSSGSVDP QLGDQKSLAS 

1151 LKRRNQCGER ESLLSGEGKG 

1201 AESLDHRSTL DLCPPRPAAL 

1251 AEFDDDIEDS CCFRLHKVLE 

1301 VIAHKMFDHV VLVFIFLNCI 

1351 VVEMMVKVVA LGLLWGEHAY 

14 01 KILGVLRVVR LLRTLRPLRV 

1451 FFIIFGILGV QLFKGKFYYC 

1501 LGQALMSLFV LSSKDGWVNI 

1551 LIVSFFVLNM FVGVVVENFH 

1601 RRPYYADYSH TRRSIHSLCT 

1651 DEALKYCNYV FTIVFVFEAA 

1701 IALEEIEMNA ALPINPTIIR 

1751 LPQVGNLGLL FMLLFFIYAA 

1801 MAFLTLFRVS TGDNWNGIMK 

1851 VAQFVLVNVV VAVLMKHLEE 

1901 AQESQGTQPD TPNLLVVRKV 

1951 EVEMETYTGP VTSAHSPPLE 

2001 RSLSLSRILC RQEAMHSESL 

2051 APRSPPCSPR PASVRTRKHT 

2101 ITSSAHPWPA TEPHSPEASP 

2151 GRPDAQRWSS VELDNGESHL 

2201 IEPPTKDEGS SRPPAAEGGN 

2251 PVAKGERWGQ ASCRAEHLTV 

2301 VSSLGAIVPL ILETELSMPS 

2351 PDDSGDEPV 

HITS AT: 1969-1974 



ATRSDTDEDK TSTQLEGDFD KLRDLRATEM 
PPLITHTAAT PMPTPKSSPN LDVAHALLDS 
LRSSPCTPWG PNSAGSSRRS SWNSLGRAPS 
STDDEAEDSR PSTGTHPGAS PGPRATPLRR 
LPTKFHDCNG QMVALPSEFF LRIDSHKEDA 
PYAPQWCRSR ESWALYLFPP QNRLRVSCQK 
TIALERPDID PGSTERAFLS VSNYIFTAIF 
LQSSWNVLDG LLVLVSLVDI IVAMASAGGA 
ISRAPGLKLV VETLISSLRP IGNIVLICCA 
EGTDTRNITT KAECHAAHYR WVRRKYNFDN 
MYDGLDAVGI DQQPVQNHNP WMLLYFISFL 
KCRQHQEAEE ARRREEKRLR RLERRRRKAQ 
SHYLDLFITF IICLNVITMS MEHYNQPKSL 
LKLVAFGFRR FFKDRWNQLD LAIVLLSIMG 
IMRVLRIARV LKLLKMATGM RALLDTVVQA 
LGVELFGRLE CSEDNPCEGL SRHATFTNFG 
DTLRECTRED KHCLSYLPAL SPVYFVTFML 
SNKEAREDAE MDAEIELEMA QGSTAQPPPT 
SVSRMLSLPN DSYMFRPVAP AAAPHSHPLQ 
PRASFQVPSA ASSPARVSDP LCALSPRGTP 

EGKVDDVGGD SIPDYTEPAE NMSTSQASTG 
FGQRCISSRP PTLGGDEAEA ADPADEEVSH 
TASPVKGTMG SGRDPRRFCS VDAQSFLDKP 
ESGEVRGRAS ELEPALGSRR KKKMSPPCIS 
TTLRRRTPSC EAALHRDCPE PTEGPGTGGD 
PNFAFEPLDM GGPGGDCFLD SDQSVTPEPR 
PDCPEKEQGL YLTVPQTPLK KPGSTPAT PA 



REFERENCE 1: 135:15597 



L4 ANSWER 10 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 332001-60-2 REGISTRY 

CN Bone morphogenetic protein receptors,, protein BRK-3 (mouse clone 

pJT6-mBRK-3L) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 7: PN: US6210899 SEQID: 8 claimed protein 
CI - MAN 
SQL 2887 



SEQ 1 METTHRSERS ERLEHISARG PRPHEARGVA LPRTRPLELE TRPALAVALL 

51 ELEVALSERT HRTHRALAAL ASERGLNASN GLNGLARGLE CYSALAPHEL 

101 YSASPPRTYR GLNGLNASPL EGLYILEGLY GLSERARGIL ESERHISGLA 

151 SNGLYTHRIL ELECYSSERL YSGLYSERTH -RCYSTYRGLY LETRPGLLYS 

201 SERLYSGLYA SPILEASNLE VALLYSGLNG LYCYSTRPSE RHISILEGLY 

251 ASPPRGLNGL CYSHISTYRG LGLCYSVALV ALTHRTHRTH RPRPRSERIL 

301 EGLNASNGLY THRTYRARGP HECYSCYSCY SSERTHRASP LECYSASNVA 

351 LASNPHETHR GLASNPHEPR PRPRASPTHR THRPRLESER PRPRHISSER 

4 01 PHEASNARGA SPGLTHRILE ILEILEALAL EALASERVAL SERVALLEAL 

451 AVALLEILEV ALALALECYS PHEGLYTYRA RGMETLETHR GLYASPARGL 

501 YSGLNGLYLE HISSERMETA SNMETMETGL ALAALAALAA LAGLPRSERL 

551 EASPLEASPA SNLELYSLEL EGLLEILEGL YARGGLYARG TYRGLYALAV 

601 ALTYRLYSGL YSERLEASPG LARGPRVALA LAVALLYSVA LPHESERPHE 

651 ALAASNARGG LNASNPHEIL EASNGLLYSA SNILETYRAR GVALPRLEME 

701 TGLHISASPA SNILEALAAR GPHEILEVAL GLYASPGLAR GLETHRALAA 

751 SPGLYARGME TGLTYRLELE VALMETGLTY RTYRPRASNG LYSERLECYS 



Searcher : Shears 308-4994 



09/627383 



801 LYSTYRLESE RLEHISTHRS 
851 SSERVALTHR ARGGLYLEAL 
901 TYRLYSPRAL AILESERHIS 
951 SASNASPGLY ALACYSVALI 
1001 GLYASNARGL EVALARGPRG 
1051 LYTHRILEAR GTYRMETALA 
1101 PCYSGLSERA LALELYSGLN 
1151 PGLVALPHEM ETARGCYSTH 
1201 GLNMETALAP HEGLNTHRGL 
1251 NVALLEVALS ERARGGLLYS 
1301 SNSERLEALA VALARGSERL 
1351 SPALAGLALA ARGLETHRAL 
1401 METILETRPG LARGASNLYS 
1451 RALAMETGLN ASNGLARGAS 
1501 LYPRTYRPRA SPTYRSERSE 
1551 THRASPSERI LEVALLYSAS 
1601 PRLETHRILE GLYGLLYSAS 
1651 ALAGLNALAA RGILEPRSER 
17 01 THRTHRTHRT HRASNTHRTH 
1751 LESERGLMET PRTYRPRASP 
1801 NSERILEGLY PRTHRPRVAL 
1851 LYSLEASPPR LYSGLVALAS 
1901' METGLHISSE RLELYSGLNP 
1951 ERSERLELET YRPRLEILEL 
2001 PHETHRGLNA LAALAASNGL 

2051 GLNILETYRP RLEPRLYSGL 

2101 EASNTHRLYS ASNSERTHRL 

2151 YSSERASNLE LYSGLNVALG 

2201 NALAALAGLP RHISVALVAL 

2251 SERHISASNV ALASNSERHI 

2301 AVALPRALAG LYGLNALAAL 

2351 LEGLNASNGL NPHEILEGLY 

24 01 ASPGLHISGL PRLELEARGA 

24 51 LEASPARGLE VALASPARGA 

. 2501 RASNASNASN ASNSERASNP 

2551 ALTHRSERTH RALAALAASP 

2601 PRASNSERLE ASPLESERAL 

2651 ILEGLYGLSE RTHRGLNASP 

2701 RGARGVALLY STHRPRTYRS 

2751 ILESERTHRG LPRLEASPCY 

2801 VALHISSERL YSSERSERTH 

2851 HRTHRTHRVA LSERLYSASP 

HITS AT: 2034-2039 



ERASPTRPVA LSERSERCYS ARGLEALAHI 
ATYRLEHIST HRGLLEPRAR GGLYASPHIS 
ARGASPLEAS NSERARGASN VALLEVALLY 
LESERASPPH EGLYLESERM ETARGLETHR 
LYGLGLASPA SNALAALAIL ESERGLVALG 
PRGLVALLEG LGLYALAVAL ASNLEARGAS 
VALASPMETT YRALALEGLY LEILETYRTR 
RASPLEPHEP RGLYGLSERV ALPRASPTYR 
VALGLYASNH ISPRTHRPHE GLAS PMETGL 
GLNARGPRLY SPHEPRGLAL ATRPLYSGLA 
ELYSGLTHRI LEGLASPCYS TRPASPGLNA 
AGLNCYSALA GLGLARGMET ALAGLLEMET 
SERVALSERP RTHRVALASN PRMETSERTH 
NLESERHISA SNARGARGVA LPRLYSILEG 
RSERSERTYR ILEGLASPSE RILEHISHIS 
NILESERSER GLHISSERME TSERSERTHR 
NARGASNSER ILEASNTYRG LARGGLNGLN 
PRGLTHRSER VALTHRSERL ESERTHRASN 
RGLYLETHRP RSERTHRGLY METTHRTHRI 
GLTHRHISLE HISALATHRA SNVALALAGL 
CYSLEGLNLE THRGLGLASP LEGLTHRASN 
PLYSASNLEL YSGLSERSER ASPGLASNLE 
HESERGLYPR ASPPRLESER SERTHRSERS 
YS LEAL AVAL GLVALTHRGL YGLNGLNASP 
YGLNALACYS LEI LEPRAS P VALPRPRALA 

NGLNASNLEP RLYSARGPRT HRSERLEPRL 
YSGLPRARGL ELYSPHEGLY ASNLYSHISL 
LTHRGLYVAL ALALYSMETA SNTHRILEAS 
THRVALTHRM ETASNGLYVA LALAGLYARG 
SALAALATHR THRGLNTYRA LAASNGLYAL 
AASNILEVAL ALAHISARGS ERGLNGLMET 
GLASPTHRAR GLEASNILEA SNSERSERPR 
RGGLGLNGLN ALAGLYHISA SPGLGLYVAL 
RGGLARGPRL EGLGLYGLYA RGTHRASNSE 
RCYSSERGLG LNASPILELE THRGLNGLYV 
PRGLYPRSER LYSPRARGAR GALAGLNARG 
ATHRASNILE LEASPGLYSE RSERILEGLN 
GLYLYSSERG LYSERGLYGL LYSILELYSA 
ERLELYSARG TRPARGPRSE RTHRTRPVAL 
SGLVALASNA SNASNGLYSE RASPARGALA 
RALAVALTYR LEALAGLGLY GLYTHRALAT 
ILEGLYMETA SNCYSLE 



REFERENCE 1: 134:261275 



L4 ANSWER 11 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 332001-59-9 REGISTRY 

CN Bone morphogenetic protein receptors, protein BRK-3 (human foreskin 

fibroblast clone pHSK1040) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2: PN: US6210899 SEQID: 2 claimed protein 
CI MAN 
SQL 2887 

SEQ 1 METTHRSERS ERLEGLNARG PRTRPARGVA LPRTRPLEPR TRPTHRILEL 

51 ELEVALSERT HRALAALAAL ASERGLNASN GLNGLARGLE CYSALAPHEL 



Searcher 



Shears 308-4994 



09/627383 



101 YSASPPRTYR GLNGLNASPL 

151 SNGLYTHRIL ELECYSSERL 

201 SERLYSGLYA SPILEASNLE 

251 ASPPRGLNGL CYSHISTYRG 

301 EGLNASNGLY THRTYRARGP 

351 LASNPHETHR GLASNPHEPR 

401 PHEASNARGA SPGLTHRILE 

451 AVALLEILEV ALALALECYS 

501 YSGLNGLYLE HISSERMETA 

551 EASPLEASPA SNLELYSLEL 
601. ALTYRLYSGL YSERLEASPG 

651 ALAASNARGG LNASNPHEIL 

701 TGLHISASPA SNILEALAAR 

751 ASPGLYARGM ETGLTYRLEL 

801 SLYSTYRLES ERLEHISTHR 

851 ISSERVALTH RARGGLYLEA 

901 STYRLYSPRA LAILESERHI 

951 YSASNASPGL YTHRCYSVAL 

1001 RGLYASNARG LEVALARGPR 

1051 GLYTHRILEA RGTYRMETAL 

1101 SPCYSGLSER ALALELYSGL 

1151 RPGLILEPHE METARGCYST 

1201 GLNMETALAP HEGLNTHRGL 

1251 NVALLEVALS ERARGGLLYS 

1301 SNSERLEALA VALARGSERL 

1351 SPALAGLALA ARGLETHRAL 

1401 METILETRPG LARGASNLYS 

1451 RALAMETGLN ASNGLARGAS 

1501 LYPRTYRPRA SPTYRSERSE 

1551 THRASPSERI LEVALLYSAS 

1601 PRLETHRILE GLYGLLYSAS 

1651 ALAGLNALAA RGILEPRSER 

17 01 THRTHRTHRT HRASNTHRTH 

1751 LESERGLMET PRTYRPRASP 

1801 NSERILEGLY PRTHRPRVAL 

1851 LYSLEASPPR LYSGLVALAS 

1901 METGLHISSE RLELYSGLNP 

1951 ERSERLELET YRPRLEILEL 

2001 PHETHRGLNT HRALAASNGL 

2051 GLNILETYRP RLEPRLYSGL 

2101 EASNTHRLYS ASNSERTHRL 

2151 YSSERASNLE LYSGLNVALG 

2201 NALAALAGLP RH IS VALVAL 

2251 ASNHISSERV ALASNSERHI 

2301 RVALLESERG LYGLNTHRTH 

2351 LEGLNASNGL NPHEILEGLY 

24 01 ASPGLHISGL PRLELEARGA 

2451 LEASPARGLE VALAS P ARG A 

2501 RASNASNASN ASNSERASNP 

2551 ALPRSERTHR ALAALAASPP 

2601 RASNSERLEA SPLESERALA 

2651 LEGLYGLSER THRGLNASPG 

2701 SARGVALLYS THRPRTYRSE 

2751 LESERTHRGL SERLEASPCY 

2801 VALHISSERL YSSERSERTH 

2851 HRTHRMETVA LSERLYSASP 

HITS AT: 2034-2039 



EGLYILEGLY GLSERARGIL ESERHISGLA 
YSGLYSERTH RCYSTYRGLY LETRPGLLYS 
VALLYSGLNG LYCYSTRPSE RHISILEGLY 
LGLCYSVALV ALTHRTHRTH RPRPRSERIL 
HECYSCYSCY SSERTHRASP LECYSASNVA 
PRPRASPTHR THRPRLESER PRPRHISSER 
ILEILEALAL EALASERVAL SERVALLEAL 
PHEGLYTYRA RGMETLETHR GLYASPARGL 
SNMETMETGL ALAALAALAS ERGLPRSERL 
EGLLEILEGL YARGGLYARG TYRGLYALAV 
LARGPRVALA LAVALLYSVA LPHESERPHE 
EASNGLLYSA SNILETYRAR GVALPRLEME 
GPHEILEVAL GLYASPGLAR GVALTHRALA 
EVALMETGLT YRTYRPRASN . GLYSERLECY 
SERASPTRPV ALSERSERCY SARGLEALAH 
LATYRLEHIS THRGLLEPRA RGGLYASPHI 
SARGASPLEA SNSERARGAS NVALLEVALL 
ILESERASPP HEGLYLESER METARGLETH 
GLYGLGLASP ASNALAALAI LESERGLVAL 
APRGLVALLE GLGLYALAVA LASNLEARGA 
N VALAS PMET TYRALALEGL YLEILETYRT 
HRASPLEPHE PRGLYGLSER VALPRGLTYR 
VALGLYASNH ISPRTHRPHE GLASPMETGL 
GLNARGPRLY SPHEPRGLAL ATRPLYSGLA 
ELYSGLTHRI LEGLASPCYS TRPASPGLNA 
AGLNCYSALA GLGLARGMET ALAGLLEMET 
SERVALSERP RTHRVALASN PRMETSERTH 
NLESERHISA SNARGARGVA LPRLYSILEG 
RSERSERTYR ILEGLASPSE RILEHISHIS 
NILESERSER GLHISSERME TSERSERTHR 
NARGASNSER ILEASNTYRG LARGGLNGLN 
PRGLTHRSER VALTHRSERL ESERTHRASN 
RGLYLETHRP RSERTHRGLY METTHRTHRI 
GLTHRASNLE HISTHRTHRA SNVALALAGL 
CYSLEGLNLE THRGLGLASP LEGLTHRASN 
PLYSASNLEL YSGLSERSER ASPGLASNLE 
HESERGLYPR ASPPRLESER SERTHRSERS 
YSLEALAVAL GLALATHRGL YGLNGLNASP 
YGLNALACYS LEILEPRASP VALLEPRTHR 



NGLNASNLEP RLYSARGPRT HRSERLEPRL 
YSGLPRARGL ELYSPHEGLY SERLYSHISL 
LTHRGLYVAL ALALYSMETA SNTHRILEAS 
THRVALTriRM ETASNGLYVA LALAGLYARG 
SALAALATHR THRGLNTYRA LAASNGLYTH 
RASNILEVAL THRHISARGA LAGLNGLMET 
GLASPTHRAR GLEASNI'LEA SNSERSERPR 
RGGLGLNGLN ALAGLYHISA SPGLGLYVAL 
RGGLARGPRL EGLGLYGLYA RGTHRASNSE 
RCYSSERGLG LNASPVALLE ALAGLNGLYV 
RGLYPRSERL YSPRARGARG ALAGLNARGP 
THRASNVALL EASPGLYSER SERILEGLNI 
LYLYSSERGL YSERGLYGLL YSILELYSLY 
RLELYSARGT RPARGPRSER THRTRPVALI 
SGL VALAS NA SNASNGLYSE RASNARGALA 
RALAVALTYR LEALAGLGLY GLYTHRALAT 
ILEGLYMETA SNCYSLE 



Searcher 



Shears 308-4994 



09/627383 



REFERENCE 1: 134:261275 

L4 ANSWER 12 OF 28 REGISTRY COPYRIGHT 2002 ACS 
.RN 331287-05-9 REGISTRY 

CN 1: PN: WO0119971 SEQID: 1 unclaimed protein (9CI) (CA INDEX NAME) 
CI MAN 
SQL 682 

SEQ 1 METSERLYSL ESERVALPHE PHEILEPHEL EPHECYSSER ILEALATHRA 

51 LAALAGLSER LEPRASPLEL YSILEGLLYS LEASPGLGLY VALTYRVALH 

101 ISTHRSERPH EGLGLVALAS NGLYTRPGLY VALVALPRLY SHISGLYLEV 

151 ALVALLEVAL ASNALAGLAL ATYRLEILEA SPTHRPRPHE THRALALYSA 

201 SPTHRGLLYS LEVALTHRTR PPHEVALGLA RGGLYTYRLY SILELYSGLY 

251 SERILESERS ERHISPHEHI SSERASPSER THRGLYGLYI LEGLTRPLEA 

301 SNSERARGSE RILEPRTHRT YRALASERGL LETHRASNGL LELELYSLYS 

351 ASPGLYLYSV ALGLNALATH RASNSERPHE SERGLYVALA SNTYRTRPLE 

4 01 VALLYSASNL YSILEGLVAL PHETYRPRGL YPRGLYHIST HRPRASPASN 

4 51 VALVALVALT RPLEPRGLAR GLYSILELEP HEGLYGLYCY SPHEILELYS 

501 PRTYRGLYLE GLYASNLEGL YASPALAASN ILEGLALATR PPRLYSSERA 

551 LALYSLELEL YSSERLYSTY RGLYLYSALA LYSLEVALVA LPRSERHISS 

601 ERGLVALGLY ASPALASERL ELELYSLETH RLEGLGLNAL AVALLYSGLY 

651 LEASNGLSER LYSLYSPRSE RLYSPRSERA SN 

HITS AT: 61-66 

REFERENCE 1: 134:248841 

L4 ANSWER 13 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 326928-35-2 REGISTRY 

CN Protein (human clone PLACE1004777 ) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 168: PN: EP1074617 SEQID: 12736 claimed protein 
CI MAN 
SQL 526 

SEQ 1 MDRASVCKLC KYACHKKCCL KTTAKCSKKY DPELSSRQFG VELSRLTSED 

51 RTVPLWEKL INYIEMHGLY TEGIYRKSGS TNKIKELRQG LDTDAESVNL 
101 DDYNIHVIAS VFKQWLRDLP NPLMTFELYE EFLRAMGLQE RKETIRGVYS 
151 VIDQLSRTHL NTLERLIFHL VRIALQEDTN RMSANALAIV FAPCILRCPD 
201 TTDPLQSVQD ISKTTTCVEL IVVEQMNKYK ARLKDISSLE FAENKAKTRL 
251 SLIRRSMGKG RIRRGNYPGP SSPVVVRLPS VSDVSEETLT SEAAMETDIT 
301 EQQQAAMQQE ERVLTEQIEN LQKEKEELTF EMLVLEPRAS DDETLESEAS 



351 IGTADSSENL NMESEYAISE KSERSLALSS LKTAGKSEPS SKLRKQLKKQ 

401 QDSLDVVDSS VSSLCLSNTA SSHGTRKLFQ IYSKSPFYRA ASGNEALGME 

451 GPLGQTKFLE DKPQFISRGT FNPEKGKQKL KNVKNSPQKT KETPEGTVMS 

501 GRRKTVDPDC TSNQQLALFG NNEFMV 

HITS AT: 335-340 

REFERENCE 1: 134:188985 

L4 ANSWER 14 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 324745-26-8 REGISTRY 

CN L-Serine, L-leucyl-L- . alpha . -glutamyl-L-prolyl-L-arginyl-L-alanyl- 

(9CI) (CA INDEX NAME) 
SQL 6 



Searcher : Shears 308-4994 



09/627383 

SEQ 1 LEPRAS 

HITS AT: 1-6 

REFERENCE 1: 134:159864 

L4 ANSWER 15 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 319502-44-8 REGISTRY 

CN 8: PN: WO0102561 SEQID: 26 unclaimed protein (9CI) (CA INDEX NAME) 
CI MAN 
SQL 2359 

SEQ 1 MTEGTLAADE VRVPLGASPP APAAPVRASP ASPGAPGREE QGGSGSGVLA 

51 PESPGTECGA DLGADEEQPV PYPALAATVF FCLGQTTRPR SWCLRLVCNP 

101 WFEHVSMLVI MLNCVTLGMF RPCEDVECRS ERCSILEAFD DFIFAFFAVE 

151 MVIKMVALGL FGQKCYLGDT WNRLDFFIVM AGMMEYSLDG HKVSLSAIRT 

201 VRVLRPLRAI NRVPSMRILV TLLLDTLPML GNVLLLCFFV FFIFGIVGVQ 

251 LWAGLLRNRC FLDSAFVRNN NLTFLRPYYQ TEEGEENPFI CSSRRDNGMQ 

301 KCSHIPSRRE LRVQCTLGWE AYGQPQAEDG GAGRNACINW NQYYNVCRSG 

351 EFNPHNGAIN FDNIGYAWIA IFQVITLEGW VDIMYYVMDA HSFYNFIYFI 

401 LLIIMGSFFM INLCLVVIAT QFSETKQREN QLMREQRARY LSNDSTLASF 

451 SEPGSCYEEL LKYVGHIFRK VKRRSLRLYA RWQSRWRKKV DPSSTVHGQG 

501 PGRRPRRAGR RTASVHHLVY HHHHHHHHHY HFSHGGPRRP SPEPGAGDNR 

551 LVRACAPPSP PSPGHGPPDS ESVHSIYHAD CHVEGPQERA RVAHSIATAA 

601 SLKLASGLGT MNYPTILPSG TVNSKGGTSS RPKGLRGAGA PGAAVHSPLS 

651 LGSPRPYEKI QDWGEQGLG RASSHLSGLS VPCPLPSPQA GTLTCELKSC 

701 PYCASALEDP EFEFSGSESG DSDAHGVYEF TQDVRHGDCR DPVQQPHEVG 

751 TPGHSNERRR TPLRKASQPG GIGHLWASFS GKLRRIVDSK YFNRGIMAAI 

801 LVNTLSMGVE YHEQPEELTN ALEISNIVFT SMFALEMLLK LLACGPLGYI 

851 RNPYNIFDGI VWISVWEIV GQADGGQSVL RTFRLLRVLK LVRFLPALRR 

901 QLVVLMRTMD NVATFCMLLM LFIFIFSILG MHLFGCKFSL KTDSGDTVPD 

951 RKNFDSLLWA IVTVFQILTQ EDWNWLYNG MASTSSWAAL YFVALMTFGN 

1001 YVLFNLLVAI LVEGFQAEGD ATRSDTDEDK TSTQLEGDFD KLRDLRATEM 

1051 KMYSLAVTPN GHLEGRGSLP PPLITHTAAT PMPTPKSSPN LDVAHALLDS 

1101 RRSSSGSVDP QLGDQKSLAS LRSSPCTPWG PNSAGSSRRS SWNSLGRAPS 

1151 LKRRSQCGER ESLLSGEGKG STDDEAEDSR PSTGTHPGAS PGPRATPLRR 

1201 AESLDHRSTL DLCPPRPAPP AVQVHDCNGQ MVALPSEFFL RIDSHKEDAA 

1251 EFDDDIEDSC CFRLHKVLEP YAPQWCRSRE SWALYLFPPQ NRLRVSCQKV 

1301 IAHKMFDHVV LVFIFLNCIT IALERPDIDP GSTERAFLSV SNYIFTAIFV 

1351 VEMMVKVVAL GLLWGEHAYL QSSWNVLDGL LVLVSLVDII VAMASAGGAK 

1401 ILGVLRVVRL LRTLRPLRVI SRAPGLKLW ETLISSLRPI GNIVLICCAF 

1451 FIIFGILGVQ LFKGKFYYCE GTDTRNITTK AECHAAHYRW VRRKYNFDNL 

1501 GQALMSLFVL SSKDGWVNIM YDGLDAVGID QQPVQNHNPW MLLYFISFLL 

1551 IVSFFVLNMF VGWVENFHK CRQHQEAEEA RRREEKRLRR LERRRRKAQR 

1601 RPYYADYSHT RRSIHSLCTS HYLDLFITFI ICLNVITMSM EHYNQPKSLD 

1651 EALKYCNYVF TIVFVFEAAL KLVAFGFRRF FKDRWNQLDL AIVLLSIMGI 

1701 ALEEIEMNAA LPINPTIIRI MRVLRIARVL KLLKMATGMR ALLDTWQAL 

1751 PQVGNLGLLF MLLFFIYAAL GVELFGRLEC SEDNPCEGLS RHATFTNFGM 

1801 AFLTLFRVST GDNWNGIMKD TLRECTREDK HCLSYLPALS PVYFVTFMLV 

1851 AQFVLVNVVV AVLMKHLEES NKEAREDAEM DAEIELEMAQ GSTAQPPPTA 

1901 QESQGTQPDT PNLLWRKVS VSRMLSLPND SYMFRPVAPA AAPHSHPLQE 

1951 VEMETYTGPV TSAHSPPLEP RAS FQVPSAA SSPARVSDPL CALSPRGTPR 

2001 SLSLSRILCR QEAMHSESLE GKVDDVGGDS IPDYTEPAEN MSTSQASTGA 

2051 PRSPPCSPRP ASVRTRKHTF GQRCISSRPP TLGGDEAEAA DPADEEVSHI 

2101 TSSAHPWPAT EPHSPEASPT AS PVKGTMGS GRDPRRFCSV DAQSFLDKPG 

2151 RPDAQRWSSV ELDNGESHLE SGEVRGRASE LEPALGSRRK KKMSPPCISI 

2201 EPPTKDEGSS RPPAAEGGNT TLRRRTPSCE AALHRDCPEP TEGPGTGGDP 



Searcher 



Shears 



308-4994 



09/627383 



2251 VAKGERWGQA SCRAEHLTVP NFAFEPLDMG GPGGDCFLDS DQSVTPEPRV 

2301 SSLGAIVPLI LETELSMPSP DCPEKEQGLY LTVPQTPLKK PGSTPATPAP 

2351 DDSGDEPVZ 

HITS AT: 1968-1973 

REFERENCE 1: 134:111253 

L4 ANSWER 16 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 316927-56-7 REGISTRY 

CN 10: PN: WO0100822 SEQID: 10 unclaimed protein (9CI) (CA INDEX NAME) 
CI MAN 
SQL 3096 

SEQ 1 METGLGLPHE ASPLEVALLY STHRLEHISL YSTHRSERSE RSERVALGLY 

51 SERASPGLAS NSERLEHISS ERLEGLYLEA SNLEASNTHR ASPARGSERS 

101 ERPRHISLES ERTHRASNGL YVALSERSER PHESERGLYL YSTHRARGPR 

151 SERVALILEG LNGLYTHRVA LGLVALLETH RSERLEMETG LNGLLEGLNA 

201 SNSERGLYLY STHRASPSER GLLETRPLYS ASNCYSGLTH RARGTRPLEG 

251 LNLEPHEASN LEVALGLLYS GLNCYSGLNG LGLNILEVAL ALAGLNGLNG 

301 LGLNPHEHIS ASNGLNILEG LNHISILEGL NGLGLILELY SASNLEVALL 

351 YSLEGLNTHR SERSERALAS ERLEALASER CYSGLGLYAS NSERSERASN 

401 LYSGLNVALS ERSERGLSER GLNMETGLYP HEPHESERLE SERSERGLAR 

451 GASNGLSERV ALILEHISTY RPRGLSERTH RGLPRGLILE GLNGLNGLME 

501 TSERTHRSER GLNPRASPCY SASNVALASP SERCYSSERV ALSERSERGL 

551 YTYRGLYTHR PHECYSILES ERGLLEASNL ETYRLYSSER LYSASPPRLY 

601 SGLPHEMETG LHISILEASP VALPRLYSGL YGLNTYRVAL ALAPRALAVA 

651 LPRALAGLSE RLEVALASPG LYVALLYSAS NGLASNPHET YRILEGLNTH 

701 RPRGLGLCYS HISVALSERL ELYSGLASPV ALSERILESE RPRGLYGLPH 

751 EGLHISASNP HELEGLYGLA SNLYSVALSE RGLVALTYRS ERGLYLYSTH 

801 RASNSERASN ALAILETHRS ERTRPALAGL NLYSLELYSG LNASNGLNPR 

851 LYSARGALAH ISVALGLASP GLYGLYSERA RGSERLYSGL NGLYASNGLG 

901 LNSERLYSLY STHRPRILEG LLYSSERASP PHEALAALAA LATHRHISPR 

951 ARGALAPHET YRLESERLYS PRASPGLTHR PRASNALATR PMETSERASP 

1001 SERGLYTHRG LYLETHRTYR TRPLYSLEGL GLLYSASPME THISHISSER 

1051 LEPRGLTHRL EGLLYSTHRP HEILESERLE SERSERTHRA SPVALSERPR 

1101 ASNGLNVALL ETHRLEASPP RTHRLEHISM ETLYSPRLYS GLNGLNILES 

1151 ERGLYILEGL NPRHISGLYL EPRASNALAL EASPASPARG ILESERPHES 



1201 ERPRASPSER VALLEGLPRS ERMETSERSE RPRSERASPI LEASPSERPH 

1251 ESERGLNALA SERASNVALT HRSERGLNLE PRGLYPHEPR LYSTYRPRSE 

1301 RHISTHRLYS ALASERPRVA LASPSERTRP LYSASNGLNT HRPHEGLNAS 

1351 NGLSERARGT HRSERSERTH RPHEPRSERV ALTYRTHRIL ETHRSERASN 

1401 ASPILESERV ALASNTHRVA LASPGLGLAS NTHRVALMET VALALASERA 

1451 LASERVALSE RGLNSERGLN LEPRGLYTHR ALAASNSERV ALPRGLCYSI 

1501 LESERLETHR SERLEGLASP PRVALILELE SERLYSILEA RGGLNASNLE 

1551 LYSGLLYSHI SALAARGHIS ILEALAASPL EARGALATYR TYRGLSERGL 

1601 ILEASNSERL ELYSGLNLYS LEGLALALYS GLILESERGL YVALGLASPT 

1651 RPLYSILETH RASNGLNILE LEVALASPAR GCYSGLYGLN LEASPSERAL 

1701 ALEHISGLAL ATHRSERARG VALARGTHRL EGLASNLYSA SNASNLELEG 

1751 LILEGLVALA SNASPLEARG GLARGPHESE RALAALASER SERALASERL 

1801 YSILELEGLN GLARGILEGL GLMETARGTH RSERSERLYS GLLYSASPAS 

1851 NTHRILEILE ARGLELYSSE RARGLEGI.NA SPLEGLGLAL APHEGLASNA 

1901 LATYRLYSLE SERASPASPL YSGLALAGLN LELYSGLNGL ASNLYSMETP 

1951 HEGLNASPLE LEGLYGLTYR GLSERLEGLY LYSGLHISAR GARGVALLYS 

2001 ASPALALEAS NTHRTHRGLA SNLYSLELEA SPALATYRTH RGLNILESER 

2051 ASPLELYSAR GMETILESER LYSLEGLALA GLNVALLYSG LNVALGLHIS 

2101 GLASNMETLE SERLEARGHI SASNSERARG ILEHISVALA RGPRSERARG 

2151 ALAASNTHRL EALATHRSER ASPVALSERA RGARGLYSTR PLEILEPRGL 



Searcher 



Shears 308-4994 



09/627383 



2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
HITS AT: 



YALAGLTYRS 
ALAS PASNGL 
SPSERSERPR 
HRSERASPTH 
ELYSASNTRP 
RGGLNTHRGL 
GLNARGGLNL 
SERASNARGL 
THRVALALAT 
ERHISLELES 
ASPLEASPTH 
LEGLNPHELE 
ASPASNHISV 
LYSVALSERV 
LNCYSLYSPR 
LYSILEARGT 
SASPGLNILE 
RLEGLNTHRA 
1170-1175 



ERILEPHETH 
NLEGLGLTHR 
GLYSERSERS 
RPRILEMETA 
GLYTHRGLNT 
THRSERVALA 
YSARGLEASN 
YSSERSERTH 
YRSERPRLYS 
ERLYSASNGL 
RVALPRVALS 
PRLEASPASP 
ALASNHISSE 
ALARGTHRAL 
VALSERVALT 
HRLEALAGLT 
GLALAALALE 
RGLEASNGLN 



RGLYGLNPRL 
CYSSERLEGL 
ERTHRSERLE 
RGALALELYS 
HRGLLYSGLA 
SNALASERAR 
SERALASERG 
RPRTHRLYSA 
ARGSERPRLY 
SERSERPRIL 
ERTHRLEGLN 
SERGLGLLYS 
RSERCYSPRG 
ATRPGLLYSA 
HRPRGLNGLY 
HRGLARGPHE 
SERARGMETP 
VALLYSCYSL 



EASPTHRGLN 
YHISARGSER 
LEILELYSLY 
GLLEASPGLG 
SPTHRSERAS 
GSERPRGLLY 
LNARGSERSE 
RGGLILEMET 
SGLASNLESE 
EARGPHEASP 
ARGTHRASNP 
THRTYRSERG 
LPRVALPRAS 
SNLYSSERVA 
ASNASPPHEG 
PHEASPGLLE 
RSERPRGLYG 
ESERLEASNL 



ASPSERASNV 
PRLEGLLYSA 
SGLNARGGLT 
LYLYSILEPH 
NILEASNPRA 
SCYSALAGLN 
RSERLEPRPR 
LETHRPRVAL 
RPRGLYPHES 
ILELELEASP 
RARGLYSGLN 
LLYSALATHR 
NGLYVALLYS 
LSERTYRGLG 
LTYRTHRALA 
THRLYSGLLY 
LYARGILETH 
ELEEND 



REFERENCE 



1: 134:82484 



(CA INDEX NAME) 



L4 ANSWER 17 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 284706-28-1 REGISTRY 

CN Protein (Xylella fastidiosa gene XF1737) (9CI) 
OTHER NAMES: 

CN GenBank AE003997-derived protein GI 9106807 
CI ' MAN 
SQL 242 

SEQ 1 MKPSRRSVLK SMGLLAAMPW LLPASRAFAA APMRIGVIGA GSLGGTVGRL 

51 WVKAGHEVMF SSRNPDKLEA MARELEPRAS VGQPLAATEF GTVLLLAVPF 



101 EALPQVGRDL RSAYRGKIVL DSTNPWGASS ADVYREAREL GVAQTWKYM 
151 PGARLVRAFS AV DAT WETS ASRRGGRIGM PLASDDAEAM KVAEGLVRDA 
201 GCDPVIVGNL AAAASFQPGG PGFRAHLTAP ELRRRLGLPA AS 
HITS AT: 75-80 



REFERENCE 



1: 133:130477 



L4 ANSWER 18 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 263525-23-1 REGISTRY 

CN Protein (Drosophila melanogaster gene CG12263) 

NAME) 
OTHER NAMES: 

CN GenBank AE003799-derived protein GI 7302608 
CI MAN 
SQL 1077 



(9CI) (CA INDEX 



SEQ 



1 
51 
101 
151 
201 
251 
301 
351 
401 



MNLTYLFVLI 
YVLTDEVVNE 
PLPYENKLVV 
IDIGSNFASP 
PSFDIFDLDS 
EMARKLGEMN 
LLFAYSKQHR 
NIVPDLRVPH 
LETEFILLTH 



WLAFLISSGV 
IFKDVNASSN 
LQELLQQNPD 
EINEDNIIDQ 
VDNEILKHLP 
EVIRSVVAAM 
FYGNDSGSDS 
LNKFQTLLLH 
RVQTVYNEVA 



MLFSRGFLLA 
LCLPQKSKVI 
HARLMRFRAD 
IVKNDLPWF 
KELESKDWQV 
DNDTTLLVMG 
EMLQQIDLVP 
SWQNAQQIYR 
FKSFVRDLNT 



RVSKTETSTC 
VLWDALKYE 
PPTTTLQRLK 
LGDSTWTDLY 
LVAHFLGVDH 
DHGMTASGDH 
TLATILGVPI 
YFFQYALENK 
NLRDILGTCR 



RRLSTNPNAE 
FGLYRANATD 
GLTTGSLPTF 
PHRFKRSYSY 
CGHKHGPMHE 
GGDTDDETNA 
PYSNLGLVNF 
RTFNVEQMDH 
EIWVRFDPTQ 
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451 MSQGLLFTFL 

501 YRYFKTFSFK 

551 HMPTRLLLFG 

601 DFRTKFKASQ 

651 NTGGAGFSLK 

701 LARYGPTVAS 

751 SWAPLMTFVL 

801 VVYGLATVYS 



PLFFIFLVVN 
TEEQGVIFFT 
SMAVFFSNSF 
FLRSTALRLI 
KPGTGKTYIL 
ICAGGHILLA 
PPRSSHTVTI 
SIVIAFGVFL 



NSRPADFPHI 
AISSAVILAF 
VIQEAKILSY 
LASVLAICLI 
AVVVLVVYTT 
NSSIKHIQRT 
NGNESIVPEI 
ALVMIVLLEP 



FKAKEVFYVY 
HTLRHWTSIA 
LLAAAILLLS 
RFAYTLFRCR 
LTRLYLRSCG 
HIDAMALVIY 
FRKMKRMYEG 
RASIGLWCV 



LINLAAGVFG 
TNWSAVKRFG 
HELLRLSARL 
EEQGNCSDFV 
NLTGNLPNVL 
GLLLVQIIVL 
DDDERRSHIP 
AVGAILLSVH 



851 SILRYRTATS FESCVQPTFT ALVGWFLLAH FCFFATSHQT TLSQIEWRAA 

901 FVGRTTGIGQ SNLVSGALVI LNTFCGPIFF FCMYSLLSTE TFSLFALFPN 

951 LIRSCRSGGK VDASTSMSDL ANEAVGFDMT RGELSLYEYE DVFLGTGFKL 

1001 ATQFFMLQGL KIFCAMLACT IHCRHLMVWK IFAPRFIYEA LATFVSLPAL 

1051 IVGYLLVLRI HRGVDTLIKR INKAKVH 

HITS AT; 828-833 

REFERENCE 1: 132:275067 



L4 ANSWER 19 OF 28 REGISTRY COPYRIGHT 2002 ACS 

RN 262988-39-6 REGISTRY 

CN Protein (Drosophila melanogaster gene Mer) (9CI) (CA INDEX NAME) 
OTHER NAMES : 

CN GenBank AE003512-derived protein GI 7293633 

CI MAN 

SQL 635 

SEQ 1 MSPFGSKKNR SLSVRVSTFD SELEFKLEPR ASGQDLFDLV CRTIGLRESW 



HITS AT 



51 YFGLQYVDTR 

101 EELIQEITQH 

151 TYKDGMLAGG 

201 YLKIAQDLDM 

251 QWNEIRHVSF 

301 YMRRRKPDTM 

351 EKSMEHLQNE 

4 01 TEMDRLRERQ 

4 51 ELICAKMAER 

501 STPSLITSSS 

551 TDNEMEQITN 

601 ILSEAQIKAG 
27-32 



SNVSWLKMEK 
LFFLQVKQSI 
ELLPKGVTDQ 
YGVNYFPITN 
DDKKFTIRLV 
EIQQMKAQAK 
MRMANDALRR 
MKIEREKHDL 
EATARLLEFL 
TNDLETAGGA 
EMERNHLDYL 
ENKYSTLKKL 



RVRDQRVELH 
LSMDIYCRPE 
YQMTPEMWEE 
KNKTKLWLGV 
DAKVSNFIFY 
EEKQRRQIER 
SEETKELYFE 
EKKIRDADFY 
NSGRKSSTDS 
ELTTHSSHYL 
RNSKQVQSQL 
KSGSTKARVA 



ASNNVYVFSF 
ASVLLASYAV 
RIKTWYMDHE 
TSVGLNIYDE 
SQDLHINKMI 
KKFIREKKLR 
KSRVNEEQMQ 
VHQLTVENDK 
LLTASSVSHA 
VQGDNSSGIS 
QTLRSEIAPH 
FFEEL 



YAKFFPENVS 
HVQYGPYDYE 
PMTRDEVEME 
RDKLTPKTTF 
LDLCKGNHDL 
EKAEHERYEL 
LTECKANHFK 
REAETEKLRK 
ANTASSMAAI 
DDFEPKEFIL 
KIEENQSNLD 



REFERENCE 



132:304167 



L4 ANSWER 20 OF 28 REGISTRY COPYRIGHT 2002 ACS 

RN 260386-80-9 REGISTRY 

CN Protein NMASP (Neisseria meningitidis) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN 2: PN: WO0012535 SEQID: 2 claimed protein 

CI MAN 

SQL 1253 



SEQ 



1 METLELEPRA SPPHEVALGL NLEVALGLNS ERGLGLYPRA LAVALVALAS 



51 NILEGLNALA ALAPRALAPR ARGTHRGLNA SNGLYSERSE 

101 HRASPSERAS PPRLEALAAS PSERASPPRP HETYRGLPHE 

151 EVALPRASNM ETPRGLILEP RGLNGLGLAL AASPASPGLY 

201 EGLYSERGLY PHEILEILES ERLYSASPGL YTYRILELET 



RASNALAGLT 
PHELYSARGL 
GLYLEASNPH 
HRASNTHRHI 
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251 SVALVALTHR 

301 YRTHRALALY 

351 I LEAS PALAT 

4 01 SPRGLYGLTR 

4 51 ALTHRALAGL 



501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
HITS AT: 



PRPHEILEGL 
PHEASNLELY 
SERGLYGLYP 
ASNVALALAG 
YVALILEILE 
YSALAGLYGL 
AGLYLEARGA 
RSERGLYASP 
RLEGLYVALT 
NALAALAGLH 
LGLNGLNSER 
HISTHRASPS 
ALAGLARGAL 
PRVALASNAS 
SNVALPRLEL 
GLN 

6-11, 483-4 



GLYMETGLYS 
SLEILEGLYS 
HRGLGLLEPR 
PVALALAALA 
YVALSERALA 

NTHRASPVAL 
SGLYGLNVAL 
HEMETGLYIL 
LGLNLELYSA 
GLNGLVALSE 
YALALEILEA 
LAGLYASPIL 
LEPRVALMET 
RPARGLYSGL 
ISILEGLYAL 
GLYTHRPHES 
ERSERGLYGL 
AGLYLEARGA 
PGLALAGLYP 
EILEMETARG 



ERILELYSVA 
ERASPVALGL 
VALVALLYS I 
ILEGLYALAP 
LYSGLYARGS 

ALAILEASNP 
VALGLYILEA 
ESERPHEALA 
SNTHRGLYLY 
RTYRGLYLEA 
LALYSILELE 
EVALLESERL 
VALGLYALAI 
YGLGLILETH 
ASERSERLYS 
ERVALGLSER 
YHISLEVALV 
RGGLYASPGL 
HEARGLYSAL 
ARGGLYASNT 



LLELEASNAS 
NSERAS PVAL 
LEGLYASNPR 
RPHEGLYPHE 
ERLEPRASNG 



RGLYASNSER 
SNSERGLNIL 
ILEPRILEAS 
SVALGLNARG 
LAGLNSERPH 
PRGLYSERPR 
EASPGLYGLY 
LETHRPRGLY 
RILELYSVAL 
THRASPGLAL 
ALAGLYILET 
ALVALARGVA 
ILELEALAVA 
AMETASPLYS 
HRLEPHEILE 



PLYSARGGLT 
ALALELELYS 
LYSASPLELY 
ASPASNSERV 
LSERTYRTHR 

GLYGLYPRLE 
ETYRSERARG 
PVALALAMET 
GLYGLNLEGL 
EGLYLEASPL 
ALAGLARGAL 
GLILEARGSE 
LYSGLVALSE 
LYSLEGLYAS 
APRTYRTHRG 
HRLEGLNTHR 
LSERAS PAL A 
LGLYGLNVAL 
ALAGLYLYSA 
ALALEASNLE 



88 



REFERENCE 1: 132:204089 

L4 ANSWER 21 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 253582-33-1 REGISTRY 

CN Tumor suppressor protein (human gene AZ-2) (9CI) 
OTHER NAMES: 

CN 4: PN: WO0000503 SEQID: 4 claimed protein 
CI MAN 
SQL 1219 



(CA INDEX NAME) 



SEQ 1 GTSDSPHGRR 

51 DTLGGERRPG 

101 EAEGDITLST 

151 KAPGMEGTAA 

201 HDPKLQHLAP 

251 EGTESSPVAD 

301 SEEAFETPES 

351 VPDGPRSDSV 

401 FQTLEPRASD 



451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



SSGNPEAVAL 
PVKETQQEPD 
PKAACPLDSE 
DSGGQEDSPA 
LRRPKMKKTP 
PFSSTSKMQE 
EIPASAMEAN 
AATPETPPVI 
TFVNETKFSS 
TSQESPVKSS 
QESHLQVPEK 
GTAEVEKPAG 
MEMRKIVAEY 
KSLADLFRRY 



WSWDFAHTGV 
VTAGILEMRN 
AETQACASGD 
LHGDSPARPQ 
EELHTDRESP 
DIIQPAAPAD 
TTPVKAPPAP 
EGSPFRPPSH 
AKNQEGKVNT 

APDAYSTGSS 
EESLVPSGEN 
SAEGWPPAS 
KGLSVRLEFD 
EKLDNTPASP 
SPKLPQQSYN 
GVDGDGLNKP 
SAWHATDEE 
PTEELDYRNS 
PVRMSESPTP 
SSQKELEAMG 
LLFQQPDLDS 
EKTIAQMIED 
EKMKEVLEGF 



PGHVPRSTCA 
ALGNQSTPAP 
LPEAGTTRTF 
QDKEQPGPER 
RPGPSMLPSV 
LESPTLAASS 
PPPPPEVIPE 
PFSAVFDEDQ 
RRKSTDSVPI 

SASSTLKRTK 
LASETKTESA 
GGGRVQNSPP 
YSEDKSSWDN 
PRSPAEPNDI 
FDPDTCDESV 
AKKKKTPLKT 
KLAVTNQKWT 
YEIEYMEKIG 
CSGSSFEETE 
LGTPSEAIEI 
ALQIARAEII 
EQREKSVSHQ 
RKNEEVLKRC 



PSPQREVLTV 
PTGEVADTPL 
SVVAGDLVLP 
PIPAGDGKVC 
PKKDAPRVMD 
YHSDVVGQVS 
PEVSTQPPPE 
PIASSGTYNL 
SKSTLSRSLS 

KPRPPSLKKK 
KTEGPSPALL 
VGRKTLPLTT 
QQENPPPTKK 
PIAKGTYTFD 
DPFKTSSKTP 
VKKSPKRSPL 
CMTVDLEADK 
SSLPQDDDAP 
ALVNTAAKNQ 
REAAHPTDVS 
TKEREVSEWK 
TVQQLVLEKE 
AQEYLSRVKK 



PEANSEPWTL 
EPGKVAGAAG 
GSCQDPACSD 
VSSPPEPDET 
KVTSDETRGA 
TDLIAQRSSD 
EPGCGSETVP 
DFDNIELVDT 
LQASDFDGAS 

QTTKKPTETP 
EETPLEPAVG 
APEAGEVTPS 
IGKKPVAKMP 
IDKWDDPNFN 
SSPSKSPASF 
SDPPSQDPTP 
QDYPQPSDLS 
KKQALYLMFD 
HPVPRGLAPN 
ISKTALYSRI 
DKYEESRREV 
QALADLNSVE 
EEQRYQALKV 



Searcher : Shears 308-4994 



09/627383 

1151 HAEEKLDRAN AEIAQVRGKA QQEQAAHQAS LRKEQLRVDA LERTLEQKNK 
1201 EIEELTKICD ELIAKMGKS 
HITS AT: 404-409 

REFERENCE . 1: 132:74 540 

L4 ANSWER 22 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 250313-43-0 REGISTRY 

CN Protein (Deinococcus radiodurans strain Rl gene DR1748) (9CI) (CA 

INDEX NAME) 
OTHER NAMES: 

CN GenBank AE002016-derived protein GI 6459525 
CI MAN 
SQL 162 

SEQ 1 MHPGPAILPF MDDLIQGRLG GADGYDIRCT IDGDKISGRA GGKLHGKDIE 

51 LEITDRGVRG SVGQESVNIE LQEGELRGNV GSQKLVLRGV DRVTGFMGEP 
101 IVGWNIVAQQ QGESLQGQLG STVLGRVFSL DLGSAPGWVG TLVAWAFYA 
151 LEPRASMGAQ AS 



HITS AT: 151-156 
REFERENCE 1: 131:347336 

L4 ANSWER 23 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 244613-54-5 REGISTRY 

CN Myosin (human clone BACMYOl /BACMY02 gene MY09A isoform IXA) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AF117888-derived protein GI 5732618 

CN Myosin IXA (human clone BACMYOl /BACMY02 gene MY09A) 

CI MAN 

SQL 2548 

SEQ 1 MNINDGGRRR FEDNEHTLRI YPGAISEGTI YCPIPARKNS TAAEVIESLI 

51 NKLHLDKTKC YVLAEVKEFG GEEWILNPTD CPVQQMMLWP RMALENRLSG 
101 EDYRFLLREK NLDGSIHYGS LQSWLRVTEE RRRMMERGFL PQPQQKDFDD 
151 LCSLPDLNEK TLLENLRDRF KHEKIYTYVG SILIVINPFK FLPIYNPKYV 
201 KMYDNHQLGK PEPHIYAVAD VAYHAMLQRK KNQCIVISGE SGSGKTQSTN 
251 FLIHHLTALS QKGFASGVEQ IILGAGPVLE AFGNAKTAHN NNSSRFGKFI 
301 QVNYQETGTV LGAYVEKYLL EKSRLVYQEH NERNYHVFYY LLAGASEDER 
351 SAFHLKQPEE YHYLNQITKK PLRQSWDDYC YDSEPDCFTV EGEDLRHDFE 
4 01 RLQLAMEMVG FLPKTRRQIF SLLSAILHLG NICYKKKTYR DDSIDICNPE 
451 VLPIVSELLE VKEEMLFEAL VTRKTVTVGE KLILPYKLAE AVTVRNSMAK 
501 SLYSALFDWI VFRINHALLN SKDLEHNTKT LSIGVLDIFG FEDYENNSFE 
551 QFCINFANER LQHYFNQHIF KLEQEEYRTE GISWHNIDYI DNTCCINLIS 
601 KKPTGLLHLL DEESNFPQAT NQTLLDKFKH QHEDNSYIEF PAVMEPAFII 
651 KHYAGKVKYG VKDFREKNTD HMRPDIVALL RSSKNAFISG MIGIDPVAVF 
701 RWAILRAFFR AMVAFREAGK RNIHRKTGHD DTAPCAILKS MDSFSFLQHP 
751 VHQRSLEILQ RCKEEKYSIT RKNPRTPLSD LQGMNALNEK NQHDTFDIAW 
801 NGRTGIRQSR LSSGTSLLDK DGIFANSTSS KLLERAHGIL TRNKNFKSKP 
851 ALPKHLLEVN SLKHLTRLTL QDRITKSLLH LHKKKKPPSI SAQFQASLSK 
901 LMETLGQAEP YFVKCIRSNA EKLPLRFSDV LVLRQLRYTG MLETVQIRQS 
951 GYSSKYSFQD FVSHFHVLLP RNIIPSKFNI QDFFRKINLN PDNYQVGKTM 
1001 VFLKEQERQH LQDLLHQEVL RRIILLQRWF RVLLCRQHFL HLRQASVIIQ 
1051 RFWRNYLNQK QVRDAAVQKD AFVMASAAAL LQASWRAHLE RQRYLELRAA 
1101 AIVIQQKWRD YYRRRHMAAI CIQARWKAYR ESKRYQEQRK KIILLQSTCR 
1151 GFRARQRFKA LKEQRLRETK PEVGLVNIKG YGSLEIQGSD PSEWEDCSFD 
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1201 

1251 

1301 

1351 

1401 

1451 

1501 

1551 

1601 

1651 

1701 

1751 

1801 

1851 

1901 

1951 

2001 

2051 

2101- 

2151 

2201 

2251 

2301 

2351 



NRIKAIEECK 
LVRERPRSLE 
SEDRRWSTEL 
EGDLQFPSPK 
SITCKPQLKD 
AGEALTLDIN 
QKQLQQQNEK 
KQRVERPSSL 
VTVFFERKGS 
QSYSHNSDDL 
AWKPVKLAGP 
IRPQRAKMRF 
LAAYHPTPPL 
QNDSVQIIAS 
RQNIFSFYSS 
RVWVNTFKVF 
IFKATQYSIP 
KYDPELSSRQ 
GSTNKIKELR 
YEEFLRAMGL 
TN RMS AN ALA 
YKARLKDISS 
PSVSDVSEET 
TFEMLVLEPR 



SVIESNRISR 
DLHQKKVGRA 
VPEGLQSPRG 
ISSSPKFDSR 
SFISNSLPTF 
RETRRYHCSG 
EMMEQIRQQT 
LSLNTSNKGE 
PCQSSTVKEL 
SREGNARPIF 
GQRETSQRFS 
WAKGKQGEKK 
SPELPGSCRK 
VSDLKSMDEF 
ALAMDDGKSI 
LDEYMNEFKT 
TYCEYCSSLI 
FGVELSRLTS 
QGLDTDAESV 
QERKETIRGV 
IVFAPCILRC 
LEFAENKAKT 
LTSEAAMETD 
ASDDETLESE 



ESSVDCLKES 
KRESRRMREL 
TPDSESSQGS 
DNALSASNET 
FYIPQQDPLK 
KDQIVPSLNT 
DILEKERKAF 
LNVLGSLSLK 
SKTDRMGTQL 
FTPKDNMSIP 
SVDEQAKLHK 
TTRVKPTTQS 
EFKENKEPSP 
LLKKVNDLDN 
RYKDLYALFE 
SDCTATKVPK 
WIMDRASVCK 
EDRTVPLVVE 
NLDDYNIHVI 
YSVIDQLSRT 
PDTTDPLQSV 
RLSLIRRSMG 
ITEQQQAAMQ 
ASIGTADSSE 



PNKQQERAQS 
EQAIFSLELL 
LELLSYEESQ 
SSAEHLKDGT 
TNSQLDTSIQ 
ESSNPVLKKL 
KTIEKPRIGE 
DAALAQKDSS 
NVACKLSNNR 
LVSKEALNSK 
TMSQGEITKL 
EVSPLFAGTD 
KAKRKRSVKI 
EDSKKDTLVD 
QILEKTMRLE 
TERKKRRKKE 
LCKYACHKKC 
KLINYIEMHG 
ASVFKQWLRD 
HLNTLERLIF 
QDISKTTTCV 
KGRIRRGNYP 
QEERVLTEQI 
NLNMESEYAI 



QSGVDLQEDV 
KVRSLGGISP 
KSKLESVISD 
MKEMWCSSE 
RNKLLENEDT 
EKLNTEKEER 
CLVAPSSYQS 
SAHLPPKDRP 
ISKREHFRPT 
NPQLHKEDEP 
AVRQKASDSD 
VIPAHQFPDE 
SNVALDSMHW 
WFKKALKEF 
QRDSLGESPV 
TDLVEEHNGH 
CLKTTAKCSK 
LYTEGIYRKS 
LPNPLMTFEL 
HLVRIALQED 
ELIVVEQMNK 
GPSSPVVVRL 
ENLQKEKEEL 
SEKSERSLAL 



2401 SSLKTAGKSE PSSKLRKQLK KQQDSLDVVD SSVSSLCLSN TASSHGTRKL 

2451 FQIYSKSPFY RAASGNEALG MEGPLGQTKF LEDKPQFISR GTFNPEKGKQ 

2501 KLKNVKNSPQ KTKETPEGTV MSGRRKTVDP DCTSNQQLAL FGNNEFMV 

HITS AT: 2357-2362 



REFERENCE 



131:253834 



L4 
RN 
CN 
CI 
SQL 

SEQ 



ANSWER 24 OF 28 REGISTRY COPYRIGHT 2002 ACS 
222964-43-4 REGISTRY 

Myosin IXa (human clone BAC) (9CI) {CA INDEX NAME) 

MAN 

2548 



1 MNINDGGRRR 

51 NKLHLDKTKC 

101 EDYRFLLREK 

151 LCSLPDLNEK 

201 KMYDNHQLGK 

251 FLIHHLTALS 

301 QVNYQETGTV 

351 SAFHLKQPEE 

4 01 RLQLAMEMVG 

451 VLPIVSELLE 

501 SLYSALFDWI 

551 QFCINFANER 

601 KKPTGLLHLL 

651 KHYAGKVKYG 

701 RWAILRAFFR 

751 VHQRSLEILQ 

801 NGRTGIRQSR 

851 ALPKHLLEVN 

901 LMETLGQAEP 

951 GYSSKYSFQD 



FEDNEHTLRI 
YVLAEVKEFG 
NLDGSIHYGS 
TLLENLRDRF 
PEPHIYAVAD 
QKGFASGVEQ 
LGAYVEKYLL 
YHYLNQITKK 
FLPKTRRQIF 
VKEEMLFEAL 
VFRINHALLN 
LQHYFNQHIF 
DEESNFPQAT 
VKDFREKNTD 
AMVAFREAGK 
RCKEEKYSIT 
LSSGTSLLDK 
SLKHLTRLTL 
YFVKCIRSNA 
FVSHFHVLLP 



YPGAISEGTI 
GEEWILNPTD 
LQSWLRVTEE 
KHEKIYTYVG 
VAYHAMLQRK 
IILGAGPVLE 
EKSRLVYQEH 
PLRQSWDDYC 
SLLSAILHLG 
VTRKTVTVGE 
SKDLEHNTKT 
KLEQEEYRTE 
NQTLLDKFKH 
HMRPDIVALL 
RNIHRKTGHD 
RKNPRTPLSD 
DGIFANSTSS 
QDRITKSLLH 
EKLPLRFSDV 
RNIIPSKFNI 



YCPIPARKNS 
CPVQQMMLWP 
RRRMMERGFL 
SILIVINPFK 
KNQCIVISGE 
AFGNAKTAHN 
NERNYHVFYY 
YDSEPDCFTV 
NICYKKKTYR 
KLILPYKLAE 
LSIGVLDIFG 
GISWHNIDYI 
QHEDNSYIEF 
RSSKNAFISG 
DTAPCAILKS 
LQGMNALNEK 
KLLERAHGIL 
LHKKKKPPSI 
LVLRQLRYTG 
QDFFRKINLN 



TAAEVIESLI 
RMALENRISG 
PQPQQKDFDD 
FLPIYNPKYV 
SGSGKTQSTN 
NNSSRFGKFI 
LLAGASEDER 
EGEDLRHDFE 
DDSIDICNPE 
AVTVRNSMAK 
FEDYENNSFE 
DNTCCINLIS 
PAVMEPAFII 
MIGIDPVAVF 
MDSFSFLQHP 
NQHDTFDIAW 
TRNKNFKSKP 
SAQFQASLSK 
MLETVQIRQS 
PDNYQVGKTM 
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1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 



VFLKEQERQH 
RFWRNYLNQK 
AIVIQQKWRD 
GFRARQRFKA 
NRIKAIEECK 
LVRERPRSLE 
SEDRRWSTEL 
EGDLQFPSPK 
SITCKPQLKD 
AGEALTLDIN 
QKQLQQQNEK 
KQRVERPSSL 
VTVFFERKGS 
QSYSHNSDDL 
AWKPVKLAGP 
IRPQRAKMRF 
LAAYHPTPPL 
QNDSVQIIAS 
RQNIFSFYSS 
RVWVNTFKVF 
IFKATQYSIP 
KYDPELSSRQ 
GSTNKIKELR 
YEEFLRAMGL 
TNRMSANALA 
YKARLKDISS 
PSVSDVSEET 
TFEMLVLEPR 



LQDLLHQEVL 
QVRDAAVQKD 
YYRRRHMAAI 
LKEQRIRETK 
SVIESNRISR 
DLHQKKVGRA 
VPEGLQSPRG 
ISSSPKFDSR 
SFISNSLPTF 
RETRRYHCSG 
EMMEQIRQQT 
LSLNTSNKGE 
PCQSSTVKEL 
SREGNARPIF 
GQRETSQRFS 
WAKGKQGEKK 
SPELPGSCRK 
VSDLKSMDEF 
ALAMDDGKSI 
LDEYMNEFKT 
TYCEYCSSLI 
FGVELSRLTS 
QGLDTDAESV 
QERKETIRGV 
IVFAPCILRC 
LEFAENKAKT 
LTSEAAMETD 
ASDDETLESE 



RRIILLQRWF 
AFVMASAAAL 
CIQARWKAYR 
PEVGLVNIKG 
ESSVDCLKES 
KRESRRMREL 
TPDSESSQGS 
DNALSASNET 
FYIPQQDPLK 
KDQIVPSLNT 
DILEKERKAF 
LNVLGSLSLK 
SKTDRMGTQL 
FTPKDNMSIP 
SVDEQAKLHK 
TTRVKPTTQS 
EFKENKEPSP 
LLKKVNDLDN 
RYKDLYALFE 
SDCTATKVPK 
WIMDRASVCK 
EDRTVPLVVE 
NLDDYNIHVI 
YSVIDQLSRT 
PDTTDPLQSV 
RLSLIRRSMG 
ITEQQQAAMQ 
ASIGTADSSE 



RVLLCRQHFL 
LQASWRAHLE 
ESKRYQEQRK 
YGSLEIQGSD 
PNKQQERAQS 
EQAIFSLELL 
LELLSYEESQ 
SSAEHLKDGT 
TNSQLDTSIQ 
ESSNPVLKKL 
KTIEKPRIGE 
DAALAQKDSS 
NVACKLSNNR 
LVSKEALNSK 
TMSQGEITKL 
EVSPLFAGTD 
KAKRKRSVKI 
EDSKKDTLVD 
QILEKTMRLE 
TERKKRRKKE 
LCKYACHKKC 
KLINYIEMHG 
ASVFKQWLRD 
HLNTLERLIF 
QDISKTTTCV 
KGRIRRGNYP 
QEERVLTEQI 
NLNMESEYAI 



HLRQASVIIQ 
RQRYLELRAA 
KIILLQSTCR 
PSEWEDCSFD 
QSGVDLQEDV 
KVRSLGGISP 
KSKLESVISD 
MKEMWCSSE 
RNKLLENEDT 
EKLNTEKEER 
CLVAPSSYQS 
SAHLPPKDRP 
ISKREHFRPT 
NPQLHKEDEP 
AVRQKASDSD 
VIPAHQFPDE 
SNVALDSMHW 
WFKKALKEF 
QRDSLGESPV 
TDLVEEHNGH 
CLKTTAKCSK 
LYTEGIYRKS 
LPNPLMTFEL 
HLVRIALQED 
ELIWEQMNK 
GPSSPVWRL 
ENLQKEKEEL 
SEKSERSLAL 



2401 SSLKTAGKSE PSSKLRKQLK KQQDSLDVVD SSVSSLCLSN TASSHGTRKL 

2451 FQIYSKSPFY RAASGNEALG MEGPLGQTKF LEDKPQFISR GTFNPEKGKQ 

2501 KLKNVKNSPQ KTKETPEGTV MSGRRKTVDP DCTSNQQLAL FGNNEFMV 

HITS AT: 2357-2362 

REFERENCE 1: 130:292458 



L4 ANSWER 25 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 221651-87-2 REGISTRY 

CN GTPase-activating protein Myr-7 (Rattus norvegicus gene myo9a Rho 

protein-specific) (9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN GenBank AJ0017 13-derived protein GI 3955026 
CN Myosin-RhoGAP protein Myr 7 (rat gene myo9a) 
CI MAN 
SQL 2626 

SEQ 1 MNVSDGGRRR FEDNEHTLRI YPGTISEGTI YCPIPARKNS TAAEVIDSLI 

51 NRLHLDKTKC YVLAEVKEFG GEEWILNPTD CPVQRMMLWP RMALENRLSG 
101 EDYRFLLREK NLDGSIHYGS LQSWLRVTEE RRRMMERGFL PQPQQKDFDD 
151 LCSLPDLNEK TLLENLRNRF KHEKIYTYVG SILIAINPFK FLPIYNPKYV 
201 KMYDNHQLGK LEPHIYAVAD VAYHAMLQRK KNQCIVISGE SGSGKTQSTN 
251 FLIHHLTALS QKGFASGVEQ IILGAGPVLE AFGNAKTAHN NNSSRFGKFI 
301 QVNYQETGTV LGAYVEKYLL EKSRLVYQEH NERNYHVFYY LLAGASEEER 
351 LAFHLKQPEE YHFLNQITKK PLRQSWDDYC YDSEPDCFTV EGEDLRHDFE 
401 RLQLAMEMVG FLPKTRRQIF SLLSAILHLG NISYKKKTYR DDSIDICNPE 
4 51 VLPIVSELLE VKEEMLFEAL VTRKTVTVGE KLILPYKLAE AVTVRNSMAK 
501 SLYSALFDWI VFRINHALLN SKDLEKDTKT LSIGVLDIFG FEDYENNSFE 
551 QFCINFANER LQHYFNQHIF KLEQEEYRTE GISWHNIDYI DNTCCINLIS 
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601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 



KKPTGLLHLL 
KHYAGKVKYG 
RWAVLRAFFR 
VHQRSLEILQ 
NVRTGIRQSR 
PVLPKHLLEV 
KLMETLDQAE 
SGYSSKYSFQ 
MVFLKEHERQ 
QRFWRNYLNQ 
AAVIIQQRWR 
RGFRARQRYK 
ANRVKAIEEC 
VIVRQRPKSL 
PSEERRWSTE 
DDGELKISSP 
TAEEKLVYSS 
QRNKLPERET 
LNIEKEKRQK 
LAPSFYQSRQ 
KDRPVTLFFE 
FKSTHSYSHR 
WKSKLAGPGQ 
DIIEPLLSLD 
MSQGEITKLA 
VSSLFAGSDV 
AKRKRGVKIS 
DSKKDTLVDV 
ILEKTMRFEQ 
RKKRRKKETD 
KYACHKKCCL 
INYIEMHGLY 
VFKQWLRDLP 
STLERLIFHL 
ISKTTTCVEL 
LIAVRFMSIT 
TAMDTDVTDQ 



DEESNFPQAT 
VKDFREKNTD 
AWAFREAGK 
RCKEEKYSIT 
LPTNNTSLLD 
NSLKHLTRLT 
PYFVKCIRSN 
DFVSHFHVLL 
HLQDLLHQEV 
KQVRNAAVEK 
ELCRRRHRAA 
ALKEERLKET 
KSVIESNRIS 
EDLHQKKVGR 
LMPEGLQSPQ 
STFTNPKFDS 
QPITCKSQLR 
TLKTTLTLDI 
QLQQQNEKEM 
KVERPSSLHI 
RKGSPCQSRT 
SDDPSREGSS 
REVARPAHKK 
QASHSELGPV 
GRQKSSDLDI 
TPVHPFSDEL 
SVALDSMHWQ 
VFKKALKEFR 
RDWNESPVRV 
LVEEHNGHMF 
KTTAKCSKKY 
TEGIYRKSGS 
NPLMTFELYE 
VRIALQEDTN 
IWEQMNKYK 
RSSVSGKGRL 
QQAAMQQEEK 



NQTLLDKFKH 
HMRPDIVALL 
RHIQRKSGHD 
RKNPRTPLSD 
KDGIFANSAS 
LQDRITKSLL 
AEKLPLRFSD 
PQHIIPSKFN 
LRRIILLQRW 
DAFIMASAAS 
TCIQSRWRGY 
KLEHGLAQIK 
RESSMDFSKE 
AKRESRRMRE 
GTPDSESSQG 
QNNALSASSE 
DSFVSSSLPT 
NREARKCQFS 
MEQIRQQTDI 
QNTPSKGEAG 
VKELTKTERM 
RPIFFTPKDN 
KARMARTRSD 
SSLGQASHSD 
RPQRAKMRFW 
TQYHPTPPLS 
NDSVQIIASA 
QNIFSSYSSA 
WVNTFKVFLD 
KATQYSIPTY 
DPELSSRQFG 
TNKIKELRQG 
EFLRAMGLQE 
RMSANALAIV 
ARLKDISSLE 
HRGSHPNPSS 
VLTEQIENLQ 



QHEENSYIEF 
RSSRNAFVSG 
DTTPCTILKS 
LQGMNTLNEK 
SKLLERAHGI 
HLHKKKKPPS 
ALVLRQLRYT 
IQDFFRKINI 
FRVLLSRQQF 
LLQASWRAHL 
RQSKKYKEQR 
TCGPLEIQGS 
SPDKQQERGR 
LEQAIFSLEL 
SLELLTCDEN 
TSSTFSGKGA 
FFYIPHQEPL 
GQVTPLNPDS 
LEKERKAFKT 
VLGSPSALAT 
GTQHDAACRL 
VITPLVHSGN 
FLTRGTFADG 
SEMTSQRFSS 
AKGKQGEKKT 
PELPGSCRKE 
NDLKSMDEFL 
LAMDDGKSIR 
EYMNEFKTLD 
CEYCSSLIWI 
VELSRLTSED 
LDTDAESVNL 
RKETIRGVYS 
FAPCILRCPD 
FAENKAKTRL 
PVIVRLPSMS 
KEKEELTFEM 



PAVMEPAFII 
MTGIDPVAVF 
MDSFSFLQHP 
NQHDTFDIAW 
LTRNKNFRSK 
ISAQFQVSLS 
GMLETVRIRQ 
NPDNYQVGKT 
LHLRQASVII 
ERQRYLELRA 
NKIILLQSIY 
DPSEWEDRSF 
SQSGTDLQGD 
LKVRSLGGMS 
QKSKPESLIL 
SSDSEHLKNG 
KTSSQLDTSI 
SCTVLKKLEK 
IEQSRTEASL 
KDSPSIHLPP 
SNNHNTEREH 
PQVHKQDEPA 
EGDTEEDDYD 
VDEQARLHKA 
TRVKPAPQSE 
FKENKEPSPK 
LKKMNDLDNE 
YKDLYALFEQ 
STAPKVLKTE 
MDRASVCKLC 
RAVPLVVEKL 
DDYNIHVIAS 
VIDQLSRTHL 
TTDPLQSVQD 
SLIRRSMKPV 
DVPEETLTSE 
LVLEPRASDD 



2451 EALESEASIG TADSSENLNM DPEERSLALS SLKAAGKSEP SSKFRKQLRK 

2501 QPDSLDSVSS SVSSCLSNTT SSHGTRKRFQ IYSKSPFYRA ASACEAQGME 

2551 GPLGQAKSLE DRPQFISRGT FNPEKGKQKL KNVKNS PQKT KETPEGTVSS 

2 601 GRKKTVDSDC SSTQQLPLFG NNEFMV 

HITS AT: 2443-2448 

REFERENCE 1: 130:265212 



L4 ANSWER 26 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 218778-68-8 REGISTRY 

CN 178-273-Protein Zneul (human neuro-growth factor-like fragment) 

(9CI) (CA INDEX NAME) 
OTHER NAMES: 

CN Protein Zneul (human neuro-growth factor-like HSMHC3W5A-like (HSM2) 

domain) 
CI MAN 
SQL 256 

SEQ 1 PRLYSGLYGL YPRPRARGVA LALAPRASNP RTHRGLYVAL ASPSERALAM 

51 ETLYSGLGLV ALGLNARGLE GLNSERARGV ALASPLELEG LGLLYSLEGL 



Searcher : Shears 308-4994 



09/627383 



101 NLEVALLEAL APRLEHISSE RLE ALAS ERG LNALALEGLH ISGLYLEPRA 

151 SPPRGLYSER LELEVALHIS SERPHEGLNG LNLEGLYARG ILEASPSERL 

201 ESERGLGLNI LESERPHELE GLGLGLNLEG LYSERCYSSE RCYSLYSLYS 

251 ASPSER 

HITS AT: 146-151 



REFERENCE 



130:77730 



L4 ANSWER 27 OF 28 REGISTRY COPYRIGHT 2002 ACS 
RN 218778-65-5 REGISTRY 

CN 20-104-Protein Zneul (human neuro-growth factor-like fragment) (9CI) 

(CA INDEX NAME) 
OTHER NAMES: 

CN Protein Zneul (human neuro-growth factor-like HSMHC3W5A-like (HSM1) 

domain) 
CI MAN 
SQL 708 



SEQ 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



THRGLHISAL 
LYASPPRVAL 
HRCYSASPGL 
ALATYRARGA 
PRGLYTRPLY 
SGLNPRPRCY 
YSPRALAGLY 
RALAARGARG 
RTRPCYSGLN 
ALPRLYSGLY 
AMETLYSGLG 
GLNLEVALLE 



ATYRARGPRG 
SERGLSERPH 
YHISARGALA 
RGSERPRGLY 
SARGTHRSER 
SARGASNGLY 
TRPARGGLYA 
GLYGLYCYSP 
CYSTRPGLGL 
GLYPRPRARG 
LVALGLNARG 
ALAPRLEHIS 



LYARGARGVA 
EVALGLNARG 
CYSSERTHRT 
LEALAPRALA 
GLYLEPRGLY 
GLYSERCYSV 
SPTHRCYSGL 
RGLNARGCYS 
YHISSERLES 
VALALAPRAS 
LEGLNSERAR 
SERLEALASE 



LCYSALAVAL 
VALTYRGLNP 
YRARGTHRIL 
ARGPRARGTY 
ALACYSGLYA 
ALGLNPRGLY 
NSERASPVAL 
VALASNTHRA 
ERALAASPGL 
NPRTHRGLYV 
GVALASPLEL 
RGLNALALEG 



ARGALAHISG 
RPHELETHRT 
ETYRARGTHR 
RALACYSCYS 
LAALAILECY 
ARGCYSARGC 
ASPGLCYSSE 
LAGLYSERTY 
YTHRLECYSV 
ALASPSERAL 
EGLGLLYSLE 
LHISGLYLEP 



601 RASPPRGLYS ERLELEVALH ISSERPHEGL NGLNLEGLYA RGILEASPSE 

651 RLESERGLGL NILESERPHE LEGLGLGLNL EGLYSERCYS SERCYSLYSL 

701 YSASPSER 

HITS AT: 598-603 



REFERENCE 



130:77730 



L4 
RN 
CN 

CI 
SQL 

SEQ 
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178535-96-1 REGISTRY 

Merlin (Drosophila melanogaster moesin-ezrin-radixin-like) 
(CA INDEX NAME) 
MAN 
636 



(9CI) 



1 MSPFGSKKNR SLSVRVSTFD SELEFKLEPR ASGQDLFDLV CRTIGLRESW 



51 YFGLQYVDTR 
101 EELIQEITQH 
151 TYKDGMLAGG 
201 YLKIAQDLDM 
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Why do folding mutations decrease the 
thermosensitivity of Green 
Fluorescent Proteins? 

Zimmer, Marc; Fedeles, Flavia 

Chemistry, Connecticut College, New London, CT, 
06320, USA 

Abstracts of Papers, 223rd ACS National Meeting, 
Orlando, FL, United States, April 7-11, 2002 
(2002), CHED-401. American Chemical Society: 
Washington, D. C. 
CODEN: 69CKQP 

Conference; Meeting Abstract 
English 

Green Fluorescent Protein is a 

spontaneously fluorescent sol. globular protein isolated from 

jellyfish (Aequorea victoria) . The efficiency with which newly 

synthesized GFP folds into the fluorescent active form is temp. 

dependent, which causes difficulty in the use of GFP. Recently it 

was shown that mutation of the serine residue at 

position 147 to a proline results in a redn. of 

the temp, sensitivity of GFP. The mutated variant is able 

to efficiently maturate at temps, as high as 37 . degree. C. We have 

attempted to det . through computational methods whether the lower 

thermosensitivity of the S147P mutant is due to a better 

folding of the b-sheets in the b barrel of GFP and therefore to a 

tighter folding of the residues around the chromophore region of 

GFP. Our results show that there is no significant difference 

between the folding pattern of the b-barrel in the wild-type and 

mutant GFP but the chromophore region is more tightly 

preorganized for autocatalytic cyclization in the S147P 

mutant. 
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PRIORITY APPLN. INFO.: CA 1999-2291968 A 19991222 

CA 2000-2314861 A 20000802 

AB The present invention provides a bioluminescence resonance energy 
transfer (BRET) detection system characterized by a broad spectral 
resoln. between donor and acceptor emission wavelengths. The broad 
spectral resoln. between the emission wavelength of the 
bioluminescent donor protein and the fluorescent acceptor mol. 
results in an increased signal-to-base ratio and dynamic range in 
comparison with a basic BRET system. A BRET apoptosis sensor was 
prepd. by . recombinant ly prepg. mutant green 
fluorescent protein GFP1 fused with a linker 
peptide contg. a caspase-3 cleavage site and fused with 
mutant Renilla luciferase (Rluc) . Upon induction of 
apoptosis, caspase-3 cleaves the linker, sepg. the GFP1 from Rluc 
causing the BRET ratio to decrease over time. 

REFERENCE COUNT: 3 THERE ARE 3 CITED REFERENCES AVAILABLE FOR 

THIS RECORD. ALL CITATIONS AVAILABLE IN 
THE RE FORMAT 



L8 ANSWER 3 OF 
ACCESSION NUMBER: 
DOCUMENT NUMBER: 
TITLE: 



AUTHOR (S) : 
CORPORATE SOURCE: 



SOURCE : 



HCAPLUS COPYRIGHT 2002 ACS 
1999:324955 HCAPLUS 
131:155431 

S147P green fluorescent 
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Kimata, Yukio; Lim, Chun Ren; Kohno, Kenji 
Research and Education Center for Genetic 
Information, Nara Institute of Science and 
Technology, Nara, 630-01, Japan 
Methods in Enzymology (1999), 302 (Green 
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PUBLISHER: Academic Press 

DOCUMENT TYPE: Journal 

LANGUAGE: English 

AB A new green fluorescent protein ( 

GFP) variant is described in which serine- 

147 is mutated to proline. The S147P 

mutation alters the maturation efficiency of GFP at 

37. degree, and causes a 5-nm shift in the peak of excitation. This 
novel mutation may be useful to enhance the fluorescence 
properties of other GFP variants at 37. degree.. (c) 1999 Academic 
Pre s s 
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129:241552 

Recombinant preparation of green- 
fluorescent protein 
mutant of Aequorea victoria 

Kono, Kenji; Takeda, Katsuo; Hasegawa, Mamoru 

Dinabeck Laboratory K. K. , Japan 

Jpn. Kokai Tokkyo Koho, 11 pp. 

CODEN : JKXXAF 

Patent 

Japanese 
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JP 1997-62370 



JP 10234382 A2 19980908 

AB A green-fluorescent protein 

mutant having substitution mutations at 65- 

Ser. fwdarw.Thr and 147-Ser . f wdarw. 

Pro is prepd. by expression of the mutagenized 

gene in transgenic host cells. The mutant is able to 

generate fluorescence at high temp. (37 .degree .) . 
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fluorescent protein at high 
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Kimata, Yukio; Iwaki, Masaharu; Lim, Chun Ren; 
Kohno, Kenji 

Research and Education Center for Genetic 

Information, Nara Institute of Science and 

Technology, Nara, 630-01, Japan 

Biochem. Biophys. Res. Commun. (1997), 232(1), 
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CODEN: BBRCA9; ISSN: 0006-291X 

Academic 

Journal 
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AB Green fluorescent protein (GFP 

) from Aequorea victoria is widely used as a marker of gene 
expression and protein localization in living cells from prokaryotes 
to eukaryotes. However, the total fluorescent signal from wild-type 
GFP is very weak when expressed in cells cultured at 37. degree, 
compared to 30. degree, or below. This characteristic makes GFP 
poorly suited to use as a marker in mammalian cells. Here the 
authors describe a new variant of GFP which carries a substitution 
of Serl47 to Pro (S147P GFP) and which emits a 

stronger fluorescent signal than the wild-type GFP at high temp. 
When S147P is combined with the Ser65 to Thr mutation 
(S65T GFP) , the resulting double mutant emits fluorescence 
which is several-fold stronger than GFP with a single S65T 
modification in both bacterial or mammalian cells. This S147P 
mutation should be useful for constructing new GFP variants 
which stably emit strong fluorescence at a wide range of culturing 
temps . 
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TITLE: Functional correlation between the nuclear 

localization of Fhtlp and its flocculation and 
heat tolerance activities in budding yeast 
Saccharomyces cerevisiae 

Iha, Hidekatsu; Tezuka, Hideo; Yaguchi, So-ichi; 
Tsurugi, Kunio 

Department of Biochemistry, Yamanashi Medical 
University, Yamanashi, Japan 
Journal of Biomedical Science (Basel, 
Switzerland) (2001), 8(5), 416-420 
CODEN: JBCIEA; ISSN: 1021-7770 
S. Karger AG 
Journal 
English 

Fhtlp is involved in the flocculation and heat tolerance machinery 
of budding yeast Saccharomyces cerevisiae. Despite knowledge of its 
involvement in those phenotypes, a precise mechanism has yet to be 
discovered. To this end, the authors monitored the relationship 
between subcellular localization of Fhtlp and its flocculation or 
heat tolerance function using newly developed expression vectors 
with a recombinant green fluorescent 
protein (GFP; S65T/S147P) of Aequorea 

victoria added at both the N- and C-terminus of Fhtlp. The main 
fluorescent signal of the GFP tagged with either a wild-type Fhtlp 
or mutants which preserve their flocculation function was 
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detected in the nucleus, whereas signals of functionless 
mutants were dispersed to the cytoplasm. 
REFERENCE COUNT: 25 THERE ARE 25 CITED REFERENCES AVAILABLE 

FOR THIS RECORD. ALL CITATIONS AVAILABLE 

IN THE RE FORMAT 



L10 ANSWER 2 OF 3 HCAPLUS COPYRIGHT 2002 ACS 
ACCESSION NUMBER: 2000:595996 HCAPLUS 

DOCUMENT NUMBER: 133:263439 

TITLE: A flexible single-step detection of blotted 

antigen using a fusion protein between protein A 

and green fluorescent 

protein 

Aoki, Takashi; Miyashita, Mamiko; Fujino, 
Hiroyoshi; Watabe, Hiroyuki 
Department of Biochemistry, Faculty of 
Pharmaceutical Sciences, Health Sciences 
University of Hokkaido, Hokkaido, 061-0293, 
Japan 

Bioscience, Biotechnology, and Biochemistry 
(2000), 64(7), 1547-1551 
CODEN: BBBIEJ; ISSN: 0916-8451 

Japan Society for Bioscience, Biotechnology, and 
Agrochemistry 
Journal 
English 

A green fluorescent protein 
mutant (S147P GFP) was fused with 

protein A and expressed in Escherichia coli. This fusion protein 
(PA-GFP14 7 ) was used in immunoblotting studies as a new detection 
system, designated as "flexible single-step detection (FSSD)" . In 
FSSD, the detection of blotted antigen was done in one step, and the 
antigen-antibody reaction can be monitored by UV-irradn. in real 
time. The reaction time, therefore, can be flexibly controlled by 
monitoring the green fluorescence. 
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Imaging Cells in the Developing Nervous System 
with Retrovirus Expressing Modified 
Green Fluorescent 
Protein 

Okada, Ami; Lansford, Rusty; Weimann, James M.; 
Fraser, Scott E.; McConnell, Susan K. 
Department of Biological Sciences, Stanford 
University, Stanford, CA, 94305, USA 
Experimental Neurology (1999), 156(2), 394-406 
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Academic Press 
Journal 
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e movements of cells and their processes in 
ebrates, we constructed replication-incompetent 
ors encoding green fluorescent 
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protein (GFP) that can be detected as a single 
integrated copy per cell. To optimize GFP expression, the CMV 
enhancer and avian .beta.-actin promoter were incorporated within a 
retrovirus construct to drive transcription of red shifted (F64L, 
S65T) and codon-modif ied GFP (EGFP), EGFP tagged with GAP-43 
sequences targeting the GFP to the cell membrane, or EGFP with 
addnl. mutations that increase its ability to fold 
properly at 37. degree. (S147P or V163A, S175G) . We have 
used these viruses to efficiently mark and follow the developmental 
progression of a large population of cells in rat neocortex and 
whole avian embryos. In the chick embryo, the migration and 
development of GFP-marked neural crest cells were monitored using 
time-lapse videomicroscopy . In the neocortex, GFP clearly 
delineates the morphol. of a variety of neuronal and glial 
phenotypes. Cells expressing GFP display normal dendritic 
morphologies, and infected cells persist into adulthood. Cortical 
neurons appear to form normal local axonal and long-distance 
projections, suggesting that the presence of cytoplasmic or 
GAP-43-tagged GFP does not significantly interfere with normal 
development. (c) 1999 Academic Press. 
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Application of green fluorescent 
protein to affinity electrophoresis; affinity 
of IgG-binding domain C from streptococcal protein G 
to mouse IgGl. 

Kazama, Hitoshi (1); Yamada, Keiko; Aoki, Takashi; 
Watabe, Hiroyuki 

(1) Department of Biochemistry, Faculty of 
Pharmaceutical Sciences, Health Sciences University 
of Hokkaido, 1757 Kanazawa, Ishikari-Tobetsu, 
Hokkaido, 061-0293: kazama@hoku-iryo-u.ac.jp Japan 
Biological & Pharmaceutical Bulletin, (February, 
2002) Vol. 25, No. 2, pp. 168-171. print. 
ISSN: 0918-6158. 
Article 
English 

Affinity electrophoresis (AEP) using green 
fluorescent protein (GFP) was studied. 

We constructed a fusion protein that linked S147PGFP and IgG binding 
domain C from streptococcal protein G (GFP-SpGC) . The 
affinity of GFP-SpGC for mouse IgGl was measured. The AEP 
using GFP does not require a staining step after 
electrophoresis, and was successful with a non-purified sample. 
Therefore, this method is simple and useful for measuring many 
samples such as those used in mutational studies. 
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Bioluminescence resonance energy transfer system 
useful to monitor enzyme activity, comprises 
bioluminescent donor protein attached to first 
molecule and fluorescent acceptor molecule attached 
to second molecule. 
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(BIOS-N) BIOSIGNAL PACKARD INC 
94 



PATENT NO 



KIND DATE 



WEEK 



LA 



PG 



WO 2001046691 Al 20010628 (200143)* EN 132 



RW: AT 


BE 


CH 


CY 


DE 


DK 


EA 


ES 


FI 


FR 


GB 


GH 


GM 


GR 


IE 


IT 


KE 


LS 


LU 


MC 


MW 


MZ 


NL 


OA 


PT 


SD 


SE 


SL 


SZ 


TR 


TZ 


UG 


ZW 
















W: AE 


AG 


AL 


AM 


AT 


AU 


AZ 


BA 


BB 


BG 


BR 


BY 


BZ 


CA 


CH 


CN 


CR 


CU 


CZ 


DE 


DK 


DM 


DZ 


EE 


ES 


FI 


GB 


GD 


GE 


GH 


GM 


HR 


HU 


ID 


IL 


IN 


IS 


JP 


KE 


KG 


KP 


KR 


KZ 


LC 


LK 


LR 


LS 


LT 


LU 


LV 


MA 


MD 


MG 


MK 


MN 


MW 


MX 


MZ 


NO 


NZ 


PL 


PT 


RO 


RU 


SD 


SE 


SG 


SI 


SK 


SL 


TJ 


TM 


TR 


TT 


TZ 


UA 


UG 


US 


UZ 


VN 


YU 


ZA 


zw 





































AU 2001023348 A 20010703 (200164) 

APPLICATION DETAILS: 

PATENT NO KIND APPLICATION DATE 
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PRIORITY APPLN. INFO: CA 2000-2314861 20000802; CA 1999-2291968 . 



AN 2001-408713 [43] WPIDS 

AB WO 200146691 A UPAB: 20010801 

NOVELTY - A bioluminescence resonance energy transfer (BRET) system 
(I) comprising a bioluminescent donor protein (BDP) attached to a 
first molecule or modulator, and a fluorescent' acceptor molecule 
(FAM) attached to a second molecule or modulator, where FAM can 
accept the energy from the BDP when they are associated, in the 
presence of the appropriate substrate. 

DETAILED DESCRIPTION - In (I), a physical change in the 
modulator (s) influences the energy transfer efficiency between the 
BDP and the FAM, and (I) has a broad spectral resolution of at least 
80 nm between the peaks of BDP and FAM emission spectra. 

An INDEPENDENT CLAIM is also included for production of (I) . 

USE - (I) is useful to monitor protein-protein interactions or 
enzyme activity in vitro or in vivo (claimed) . (I) is useful for in 
vitro and in vivo detection methods, and in assays to detect 
molecular changes in a wide variety of applications such as drug 
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discovery, analyte screening, second messenger screening, drug 
screening, diagnosis, genotoxicity, identification of gene function, 
gene discovery, and proteomics. (I) is useful to study receptor 
iimerization/multimerization, to characterize orphan receptors, to 
identify ligands for orphan receptors, for detecting interaction 
which occur as a consequence of receptor signaling, and to study the 
effect of additional molecule on receptor function. 

ADVANTAGE - (I) provides an improved signal-to-base ratio (S/B) 
and dynamic range (DR) over a basic BRET system. (I) is readily 
adaptable to methods of automation and high throughput screening. 
Dwg.0/34 
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New affinity fluorescent protein comprising a 
modified fluorescent protein having a heterologous 
amino acid sequence and a ligand-activated protein 
binding site, for detecting target ligand in a 
mixture of macromolecules or in a cell. 
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20 



PATENT NO KIND DATE WEEK LA PG 



WO 2001009177 A2 20010208 (200116)* EN 44 

RW: AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE 
W: CA JP 

APPLICATION DETAILS: 

PATENT NO KIND APPLICATION DATE 



WO 2001009177 A2 WO 2000-US20619 20000728 



L12 ANSWER 3 OF 9 
ACCESSION NUMBER: 
DOC. NO. CPI: 
TITLE: 



DERWENT CLASS: 
INVENTOR (S) : 
PATENT ASSIGNEE (S) : 
COUNTRY COUNT: 
PATENT INFORMATION: 



PRIORITY APPLN. INFO: US 1999-146438P 19990729 

AN 2001-159852 [16] WPIDS 

AB WO 200109177 A UPAB: 20010323 

NOVELTY - An affinity fluorescent protein (aFP) comprising a 
modified fluorescent protein molecule with a mutated 
fluorescent protein molecule and a heterologous amino acid sequence 
having a ligand-activated protein binding site, is new. 

DETAILED DESCRIPTION - A new affinity fluorescent protein (aFP) 
comprises a modified fluorescent protein molecule with a 
mutated fluorescent protein molecule and a heterologous 
amino acid sequence having a ligand-activated protein binding site. 
The modified fluorescent protein molecule displays an altered 
spectral property when the binding site is engaged with ligand 
relative to the spectral property displayed when the binding site is 
not engaged by ligand. 

INDEPENDENT CLAIMS are also included for the following: 
(1) an aFP expression cassette or expression vector comprising 
a modified green fluorescent protein ( 
GFP) nucleic acid sequence which is mutated and 
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operatively linked to expression control sequences, where the 

modified GFP sequence comprises a recombinant peptide 

having restriction endonuclease sites introduced at a location of 

the GFP molecule selected from between Gin 157 and Lys 

158, Glu 172 and Asp 173, or both locations; 

(2) a host cell comprising a recombinant nucleic acid having 
expression control sequences operatively linked to a nucleotide . 
sequence encoding an aFP, which has a modified or mutated 

GFP molecule and a heterologous amino acid sequence functioning as a 
ligand-activated protein binding site, and which displays an^altered 
spectral property when the binding site is engaged with ligand 
relative to the spectral property displayed when the binding site is 
not engaged by ligand; 

(3) detecting the presence of a target ligand in a mixture of 
macromolecules by: 

(a) contacting a sample with an aFP comprising a binding site 
for the target ligand; 

(b) exciting the aFP with light; and 

(c) measuring the fluorescent property that differs as a result 
of ligand activation of the aFP; and 

(4) detecting the occurrence of a target ligand in a cell by: 

(a) introducing into the cell an aFP comprising a binding site 
for the target ligand; 

(b) exciting the aFP present in the cell with light; and 

(c) detecting the fluorescence pattern due to ligand activation 
of the affinity fluorescent protein in the cell and comparing it to 
the fluorescence pattern in a control cell, where the fluorescence 
pattern determines the occurrence of the target ligand in the cell. 

USE - The aFP is useful for: 

(i) detecting target ligand in a mixture of macromolecules or 
in a cell; 

<ii) for detecting and monitoring a range of in vitro and in 
vivo biological activities which include specific molecular 
processes in cells, cellular physiology, and the detection, 
quantification and/or purification of a target ligand from a wide 
variety of samples; and 

(iii) use as a substitute for reporter-molecule labeled . 
monoclonal or polyclonal antibodies. 

The aFP can covalently bind a variety of molecules (e.g. 
natural, synthetic, biological, non-biological, organic, inorganic, 
protein, non-protein small or large) , and can function both as 
molecular recognition groups and as molecular biosensors which are 
capable of sensing and reporting the interaction of a binding site 
with its cognate ligand. 
Dwg.0/11 
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Fhtlp is involved in the flocculation and heat tolerance machinery 
of budding yeast Saccharomyces cerevisiae. Despite knowledge of its 
involvement in those phenotypes, a precise mechanism has yet to be 
discovered. To this end, we monitored the relationship between 
subcellular localization of Fhtlp and its flocculation or heat 
tolerance function using newly developed expression vectors with a 
recombinant green fluorescent protein 
(GFP; S65T/S147P) of Aequorea victoria added at 

both the N- and C-terminus of Fhtlp. The main fluorescent signal of 
the GFP tagged with either a wild-type Fhtlp or 
mutants which preserve their flocculation function was 
detected in the nucleus, whereas signals of functionless 
mutants were dispersed to the cytoplasm. 

Copyright 2001 National Science Council, ROC and S. Karger AG, Basel 
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A green fluorescent protein 
mutant (S147P GFP) was fused with 

protein A and expressed in Escherichia coli. This fusion protein 
(PA-GFP147) was used in immunoblotting studies as a new detection 
system, designated as "flexible single-step detection (FSSD)". In 
FSSD, the detection of blotted antigen was done in one step, and the 
antigen-antibody reaction can be monitored by UV-irradiation in real 
time. The reaction time, therefore, can be flexibly controlled by 
monitoring the green fluorescence. 
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To visualize the movements of cells and their processes in 
developing vertebrates, we constructed replication-incompetent 
retroviral vectors encoding green fluorescent 
protein (GFP) that can be detected as a single 
integrated copy per cell. To optimize GFP expression, the 
CMV enhancer and avian beta-actin promoter were incorporated within 
a retrovirus construct to drive transcription of redshifted (F64L, 
S65T) and codon-modif ied GFP (EGFP), EGFP tagged with 
GAP-4 3 sequences targeting the GFP to the cell membrane, 
or EGFP with additional mutations that increase its 
ability to fold properly at 37 degrees C (S147P or V163A, 
S175G) . We have used these viruses to efficiently mark and follow 
the developmental progression of a large population of cells in rat 
neocortex and whole avian embryos. In the chick embryo, the 
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migration and development of GFP-marked neural crest cells 

were monitored using time-lapse videomicroscopy. In the neocortex, 

GFP clearly delineates the morphology of a variety of 

neuronal and glial phenotypes. Cells expressing GFP 

display normal dendritic morphologies, and infected cells persist 

into adulthood. Cortical neurons appear to form normal local axonal 

and long-distance projections, suggesting that the presence of. 

cytoplasmic or GAP-43-tagged GFP does not significantly 

interfere with normal development. 
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Green fluorescent protein (GFP 

) from Aequorea victoria is widely used as a marker of gene 
expression and protein localization in living cells from prokaryotes 
to eukaryotes. However, the total fluorescent signal from wild-type 
GFP is very weak when expressed in cells cultured at 37 
degrees C compared to 30 degrees C or below. This characteristic 
makes GFP poorly suited to use as a marker in mammalian 
cells. Here we describe a new variant of GFP which carries 
a substitution of Serl47 to Pro (S147P 

GFP) and which emits a stronger fluorescent signal than the 
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wild-type GFP at high temperature. When S147P. is 

combined with the Ser65 to Thr mutation (S65T GFP 

) , the resulting double mutant emits fluorescence which is 

several-fold stronger than GFP with a single S65T 

modification in both bacterial or mammalian cells. This 

S147P mutation should be useful for constructing 

new GFP variants which stably emit strong fluorescence at 

a wide range of culturing temperatures. 
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